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Preface 



The 26th International Workshop on Graph-Theoretic Concepts in Computer 
Science (WG 2000) was held at Waldhaus Jakob, in Konstanz, Germany, on 15- 
17 June 2000. It was organized by the Algorithms and Data Structures Group 
of the Department of Computer and Information Science, University of Kon- 
stanz, and sponsored by Deutsche Forschungsgemeinschaft (DFG) and Univer- 
sitatsgesellschaft Konstanz. 

The workshop aims at uniting theory and practice by demonstrating how 
graph-theoretic concepts can be applied to various areas in computer science, 
or by extracting new problems from applications. The goal is to present recent 
research results and to identify and explore directions for future research. The 
workshop looks back on a remarkable tradition of more than a quarter of a 
century. Previous Workshops have been organized in various places in Europe, 
and submissions come from all over the world. 

This year, 57 attendees from 13 different countries gathered in the relaxing 
atmosphere of Lake Constance, also known as the Bodensee. Out of 51 submissi- 
ons, the program committee carefully selected 26 papers for presentation at the 
workshop. This selection reflects current research directions, among them graph 
and network algorithms and their complexity, algorithms for special graph clas- 
ses, communication networks, and distributed algorithms. The present volume 
contains these papers together with the survey presented in an invited lecture by 
Ingo Wegener (University of Dortmund) and an extended abstract of the invited 
lecture given by Emo Welzl (ETH Zurich). 

It is my pleasure to thank all the people whose contributions made WG 2000 
a successful event: the authors of papers for submitting their work and presen- 
ting accepted papers at the workshop; the invited speakers for their remarkable 
lectures; all reviewers for their careful reports; the members of the program com- 
mittee for their timely evaluations and intensive discussion during the selection 
process, and in particular Peter Widmayer for coming to Konstanz for the final 
discussion; the DFG, the Universitatsgesellschaft Konstanz, and Springer- Verlag 
for financial support; the Friedrich-Naumann-Stiftung for funding the organiza- 
tion of the workshop at Waldhaus Jakob and Andrea Stern for supporting us 
during the workshop. 

Special thanks go to Sabine Gornelsen, Dagmar Handke, Annegret Liebers, 
Barbara Liithke, and Thomas Willhalm. They were responsible for the excellent 
preparation and perfect organization of the workshop. Furthermore, I would like 
to express my gratitude to Ulrik Brandes. While I had only to make decisions ab- 
out insignificant details, he had “a Anger in every pie,” including the organization 
of the workshop, the electronic submission and discussion, and the preparation 
of this volume. 



Konstanz, August 2000 



Dorothea Wagner 
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On the Expected Runtime and the Success 
Probability of Evolutionary Algorithms * 
(Invited Presentation) 

Ingo Wegener 

FB Informatik, LS2, Universitat Dortmund, 44221 Dortmund, Germany. 
wegener@ls2 . cs . uni-dortmund . de 



Abstract. Evolutionary algorithms are randomized search heuristics 
whose general variants have been successfully applied in black box opti- 
mization. In this scenario the function / to be optimized is not known 
in advance and knowledge on / can be obtained only by sampling search 
points a revealing the value of f{a). In order to analyze the behavior of 
different variants of evolutionary algorithms on certain functions /, the 
expected runtime until some optimal search point is sampled and the 
success probability, i.e., the probability that an optimal search point is 
among the first sampled points, are of particular interest. Here a simple 
method for the analysis is discussed and applied to several functions. For 
specific situations more involved techniques are necessary. Two such re- 
sults are presented. First, it is shown that the most simple evolutionary 
algorithm optimizes each pseudo-boolean linear function in an expected 
time of O(nlogn). Second, an example is shown where crossover decrea- 
ses the expected runtime from superpolynomial to polynomial. 



1 Introduction 

Evolutionary algorithm is the generic term for a class of randomized search heu- 
ristics. These search heuristics are investigated and applied since the mid-six- 
ties and more intensively since the mid-eighties (Fogel (1985), Goldberg (1989), 
Holland (1975), Rechenberg (1994), or Schwefel (1995)). Despite many successes 
in applications people working in the area of efficient algorithms (including graph 
theoretic methods) have ignored evolutionary algorithms. The main reasons for 
this are the following ones. Many papers on evolutionary algorithms are quite 
imprecise and claim too general results which then are not proved rigorously. 
In particular, general evolutionary algorithms are mixed with hybrid algorithms 
for special problems which use ideas from efficient algorithms for the considered 
problem. Finally, an analysis of search heuristics for the most often considered 
problems is often too difficult. One of the few successful approaches is the ana- 
lysis of the Metropolis algorithm for the graph bisection problem by Jerrum 
and Sorkin (1998). In order to understand the complicated stochastic process 

* This work was supported by the Deutsche Forschungsgemeinschaft (DFG) as part 
of the Gollaborative Research Genter “Gomputational Intelligence” (531). 



U. Brandes and D. Wagner (Eds.): WG 2000, LNCS 1928, pp. 1-10, 2000. 
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behind evolutionary algorithms and related search heuristics like simulated an- 
nealing, Rabani, Rabinovich, and Sinclair (1998) and Rabinovich, Sinclair, and 
Wigderson (1992) have investigated isolated features of these algorithms. These 
results are fundamental for a well-founded theory but they do not contain any 
results on the behavior of evolutionary algorithms when applied to certain spe- 
cific functions. These papers have been ignored in the community working on 
evolutionary algorithms. 

We strongly believe that the analysis of randomized search heuristics is an 
interesting area which should be part of the area of efficient algorithms. 

In order to distinguish general evolutionary algorithms from specialized ones, 
we introduce and discuss in Section 2 the scenario of black box optimization. 
In Section 3, we present those variants of evolutionary algorithms which are in- 
vestigated in this paper. In Section 4, we discuss a general and simple method 
to analyze the behavior of evolutionary algorithms on certain functions. This 
method leads in surprisingly many situations to quite precise results. In order to 
prove that a variant of evolutionary algorithms may have a specific property, it 
is useful to look for example functions where this method of analysis works. Ho- 
wever, this method has a limited power. For specific problems one has to develop 
specific tools which hopefully will turn out to be useful in further situations. In 
Section 5, we prove that the simplest evolutionary algorithm is efficient on all 
pseudo-boolean linear functions and, in Section 6, it is proved that crossover, 
although unnecessary for many problems, can decrease the expected runtime of 
an evolutionary algorithm from superpolynomial to polynomial. 



2 Randomized Search Heuristics in Black Box 
Optimization 

The scenario in the area of efficient algorithms is the following one. We are 
concerned with a specific and well-defined optimization problem. The structure 
behind this problem is investigated and this leads to a specialized algorithm 
which then is analyzed and implemented. 

However, there are situations where this scenario is not adequate. If a problem 
does not belong to the class of problems investigated in textbooks or scientific 
papers, people may not have the time, skills or other resources to develop a 
specialized algorithm. Moreover, the functions / to be optimized may not be 
given in a clearly structured way. In order to optimize a technical system it may 
be the only possibility to perform or simulate an experiment to evaluate / on a 
specific input a. In these situations one needs randomized search heuristics with 
a good behavior on typical problems. 

Such situations are captured by the scenario of black box optimization. The 
problem is to maximize an unknown function / : S' — >■ R. Here we only consider 
pseudo-boolean functions / : {0, 1}" — >■ M. The algorithm can choose the first 
or more generally the t-th search point based only on the current knowledge 
namely the t — 1 first search points together with their /-values. 
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This scenario has some unusual features. The randomized search strategy is 
not developed for a specific function. Hence, there will be no strategy which is 
superior to all the other ones. The aim is to develop a strategy with a good beha- 
vior on many problems, in particular, those problems expected to be considered 
in applications. 

Moreover, it should be obvious that algorithms designed for a special class 
of functions should be better than a search heuristic for black box optimization 
when applied to the same class of functions. Since black box optimization is a 
typical scenario in applications, the design and analysis of randomized search 
heuristics is well motivated. The results reveal some knowledge on the type of 
problems where the given heuristic has good properties. 

In black box optimization, the algorithm does not “know” if the best point 
ever seen is optimal. Hence, a stopping criterion is applied. After the algorithm 
has been stopped, the best of the sampled search points is presented as a so- 
lution. We abstract from this problem and consider search heuristics as infinite 
stochastic processes. We are interested in the random variable TAj describing 
for the algorithm A and the function / the first point of time when an optimal 
input is sampled. The expected runtime -Raj of A for / is the expected value 
of Taj- Another important parameter is the success probability Saja that A 
applied to / has sampled an optimal input within the first t steps. It is possible 
that Raj increases exponentially while the success probability for a polynomial 
number of steps is not too small (constant or l/poly(n)). Then a multistart 
variant of A is efficient for /. 

3 Some Variants of Evolutionary Algorithms 

The most simple variant of an evolutionary algorithm is the so-called (l-l-l)EA 
working with a population size of 1 and, therefore, without crossover. 

The first search point x is chosen randomly in {0, 1}”. This search point x 
is also the actual one. Later, a new search point x' is obtained from the actual 
one by mutation, i.e., the bits x[ are considered independently and x' differs 
from Xi with the mutation probability Pruin). The default value of Pruin) equals 
1/n ensuring that the expected number of flipping bits equals 1. The new point 
x' becomes the actual one iff f{x') > f{x). Otherwise, x remains actual. The 
notation (1-1-1) describes that we select among one parent and one child. It will 
turn out that Pmin) = Ijn often is a good choice but sometimes other values are 
better. There are methods to find a good value of Pm(n) during the search process 
(Back (1993, 1998)). However, a quite simple dynamic version called dynamic 
(H-l)EA turns out to have some good features (Jansen and Wegener (2000)). 
The first mutation step uses the parameter Pm{n) = 1/n. Before each new step 
Pm{n) gets doubled and a value larger than 1/2 is replaced immediately by 1/n. 
This leads to phases of approximately logn steps where for each meaningful 
mutation probability p one step uses a mutation probability which is close to p. 

The analysis of evolutionary algorithms without crossover is much easier than 
the analysis in the presence of the crossover operator. Although many people 
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have claimed that crossover is a useful operator, Jansen and Wegener (1999) 
were the first to prove this for some well-chosen function (see Section 6). They 
work with a population size of n, i.e., the algorithm stores n search points and 
their /-values. The first n search points are chosen randomly and independently. 
With probability Pc(n), the algorithm chooses randomly and independently two 
parents x' and x" from the current population. Uniform crossover leads to a 
random child z where the bits Zi are chosen independently and Zi = x\ if x\ = x'(. 
Otherwise, Zi = x[ with probability 1/2 and Zt = x'[ otherwise. Then y is the 
result of mutation with parameter 1/n applied to 2 ;. With probability 1 —pc{n), 
y is the result of mutation with parameter 1 /n applied to a random parent from 
the current population. Let a: be a random element of those elements of the 
current population which have the smallest /-value. The new element y replaces 
X iff f{y) > f{x) and y is not a replication of (one of) its parent(s). 

4 A Simple Method for the Analysis of Mutation Based 
Evolutionary Algorithms and Some Applications 

Let A,B C {0, 1}”. Then A <j B if f{x) <f (y) for all a; G 4 and y £ B. Let 
Ai,... ,Ap be a partition of the search space {0,1}” such that Ai <j A 2 <f 
... <f Ap and Ap is the set of all optimal search points. 

For X £ Ai let s{x) be the probability that mutation changes x into some 
y £ Aj where j > i. and let s{i) = min(s(a;)|a; £ Ai}. The expected time to 
leave Ai has an upper bound of r(z) = l/s(z). Hence, the expected runtime of 
the (l-l-l)EA on / is bounded above by the sum of all r(i), 1 < i < p — 1. The 
bound can be improved by taking into account the probabilities 7 t(z) that the first 
search point lies in Ai. It is essential to choose a “good” partition of (0, 1}”. The 
number of sets should not be too large but, nevertheless, the probabilities s(z) 
should not be too small. More involved applications of this technique consider 
also the probabilities that the search may omit some sets of the partition by 
jumping from Ai to Ai+d for some d > 2. 

We look for an upper bound on the success probability Sf^ of the (U-l)EA 
which implies a lower bound of (1 — Sf^) ■ t on the expected runtime. We have 
no success within t steps if 

• we start with a search point in Ai U • • • U A^, 

• no jump from some Aj to Aj^d U • • • U Ap takes place, 

• and less than {p — i)/{d— 1) times a change of the A-set takes place. 

Hence, we have to prove that long jumps have a very small probability and 
that s*(z) = max{s(x)|x £ Ai} is small. (It is easy to generalize the upper and 
lower bound technique to other mutation based evolutionary algorithms.) 

In the following, we analyze the (l-l-l)EA on several functions. A function 
/ : {0, 1}” — >■ R is called unimodal if each non-optimal search point x has a 
neighbor y (H{x,y) = 1 for the Hamming distance H) such that f{y) > f{x). 
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Theorem 1. Let f : {0,1}” — >■ R 6e unimodal and let I{f) be the size of the 
set image{f) = {f{x)\x G {0,1}”}. The expected runtime of the (1+1)EA is 
bounded above by e ■ n ■ (I{f) — 1). 

Proof. We partition {0, 1}” into the sets of inputs with the same /-value. Because 
of the unimodality of / there is for each non-optimal x a mutation changing one 
bit of X and leading to an improvement. Hence, s{x) > ^(1 — ^ and 

r{i) < e • n for all non-optimal H-sets. □ 

The function LO„ (LEADING ONES) counts the length of the longest prefix 
of X consisting of ones only. LO„ is unimodal and /(LO„) = n-l- 1. The upper 
bound of Theorem 1 for LO„ equals e- v? . 

Theorem 2. The expected runtime of the (1+1)EA on LOn equals Ofnf). 

Sketch of Proof. (Droste, Jansen, and Wegener, 2000). With a probability of at 
least 1 — ^ the initial search point has at most log n leading ones. The probability 
that the /-value is increased during one step equals 1/n, since one specific bit 
has to flip. The suffix of the actual point x behind the prefix 1*0 is a random 
string. Hence, the /-value increases in a successful step with probability (1/2)^ 
by J + 1. The result follows by an appropriate application of Chernoff bounds 
(Hagerup and Riib (1989)). □ 

A function / : {0, 1}” — >• R is called linear if f{x) = wiX\-\ \-WnXn for some 

weights. The linear function ONEMAX„ is defined by = • • • = Wn = 1 and 
the linear function BV„ (BINARY VALUE) by Wi = 2"“*. All linear functions 
are unimodal. The following results have been proved by Miihlenbein (1992), 
Rudolph (1997b), and Droste, Jansen, and Wegener (1998). 

Theorem 3. The expected runtime of the (1+1)EA on a linear function f where 
Wi ^ Q for all i is bounded below by l7(nlogn). It is bounded above for all linear 
functions by 0(n^) and by O(nlogn) for ONEMAXn. 

Sketch of Proof. The lower bound follows by an application of the coupon collec- 
tor’s theorem (see Motwani and Raghaven (1995)). For the general upper bound 
we assume w.l.o.g. > • • • > > 0. Let Ai = {cc|wi -I- • • • -I- Wi-i < f{x) < 

-I- • • • -I- Wi\, 1 < i < n + 1. Then s(t) > ^ for i < n. For ONEMAX„ even 
s(t) > ’ since inputs from Ai have n+l — i neighbors with a larger function 

value. □ 

In Section 5, an upper bound of O(nlogn) on the expected runtime of the 
(l-l-l)EA for an arbitrary linear function is established. The simple method 
used in this section seems to be too weak for this purpose. Unimodality is a 
natural property of many functions. Hence, people have looked for unimodal 
functions where the (l-l-l)EA needs exponential time. Such a function has to take 
exponentially many different function values and it has to be unlikely that one 
jumps over many possible function values. Long path functions (Horn, Goldberg, 
and Deb (1994)) have the property that there is a path (Hamming distance 
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between a point and its successor equals one) of exponential length where the 
value of the function increases along the path. The values of points not on the 
path lead the (1+1)EA to the source of the path. Rudolph (1997a) has described 
explicitly paths which additionally have the property that for each i < 
and each point x on the path there is at most one successor on the path whose 
Hamming distance to x equals i. Rudolph (1997a) has applied Theorem 1 to this 
function. Droste, Jansen, and Wegener (2000) have obtained an asymptotically 
matching lower bound. They have proved that the (1+1)EA meets the path 
with probability at least 1/2 in the first half of the path and that with high 
probability it needs 17(d) improvement steps if its distance to the terminal of 
the path equals d. 

Theorem 4. There are explicitly defined unimodal functions where the expected 
runtime of the (1+1)EA grows exponentially. The success probability for some 
exponentially increasing time bound is exponentially small. 

The method also works for the dynamic (1+1)EA. If the mutation probability 
1/n is good, we expect not to loose more than a factor of O(logn). 

Theorem 5. The expected runtime of the dynamic (1+1)EA is bounded by 

• 0(n(log n)/(/)) for unimodal functions, 

• 6>(n^ log n) for LOn, 

• 0(n^ log n) for all linear functions, 

• O(nlog^n) for ONEMAXn. 

We omit the proofs due to Jansen and Wegener (2000). They also have shown 
that the dynamic (1+1)EA can be very useful. We explain their results not in 
detail. They have defined a function PJ„ (path and jump) with the following 
features. There is an island I of optimal points (all vectors with logn ones and 
only zeros at the first 2 logn positions), a path P of length n/4 from i”/4Q3n/4 
to 0” consisting of all 1*0"“*, and a bridge B of all x with n/4 ones. Outside 
/ U P U R the function PJ„ is defined such that one efficiently finds the bridge 
but not the island or the path. The PJ-value on the bridge increases with the 
number of ones among the first n/4 positions. Hence, we look for 1"/4 q 3”/4^ 
source of P. For this purpose, mutation probabilities of size 1/n are good. The 
same holds on the path from i"/4Q3n/4 jg ygj-y unlikely to find the island 

before having reached 0". Then a jump to a point in Hamming distance logn is 
necessary and mutation probabilities of size (logn)/n are good. 

Theorem 6. Let Pm{n) = jj q,(u) — >• oo or a(n) — >■ 0 for n — >■ oo, 

the probability that the (1 + 1)EA on PJn needs a superpolynomial number 
of steps converges to 1. If a{n) = c, the expected runtime is 0(n^+'* ln“^ n + 
j.jC-iogc-iogin 2 ^ w/izc/i is 0(n^'^®^) for c = 1/(4 In 2). The expected runtime of 
the dynamic (1 + 1)EA on PJn is O(n^logn) and the success probability is 
1 — if t > dn^logn for some constant d. 
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Until now we have seen examples where steps with bad values of Pm{n) have 
neither positive nor negative effects. However, there are examples where bad 
values of Pm{n) lead to a disaster. There is a function PWT„ (path with a trap) 
with a path P of polynomial length and a trap T. PWT„ takes its optimal value 
at the end of P, the value is increasing along the path but the values in T are 
second best. All values outside PUT are smaller and are defined in such a way 
that it is very likely that PUT is reached at the beginning of P. For Pm{n) = 1/n 
it is very likely that we follow the path to the optimum without reaching the 
trap which is too far away (distance approximately logn). Using the dynamic 
(1 + 1) EA there are steps with mutation probabilities close to (logn)/n. Since 
the trap is large enough it is likely to reach the trap. The probability to return 
to the path is reduced to a jump to the unique optimum which is quite unlikely. 



5 The (1 + 1)EA on Linear Functions 

Theorem 7. The expected runtime of the (1 + 1)EA on a linear function is 
O(nlogn). 

Sketch of Proof . (Droste, Jansen, and Wegener (1998, 2000)). 

We have to improve the simple O(n^) bound of Theorem 3. W.l.o.g. Wi > W 2 > 

. . . > Wn > 0. It is possible that the Hamming distance from the unique optimum 
1" increases during the application of the (1 + 1)EA. E.g., for BV„ 10"“^ is 
better than 01"“^. The following technical trick is well-known from the analysis 
of efficient algorithms. The (1 -I- 1)EA works on some linear function / but 
we investigate its behavior w.r.t. to some other function, a so-called potential 
function. 

It has turned out that one linear function can serve as a potential function 
for all linear functions, namely the linear function val„ with the weights w\ = 
• • • = Wn /2 = 2 and w„/ 2 +i = • • • = = 1. Obviously, I{valn) = (3/2)n -I- 1. 

we are investigating the (1 -I- 1)EA working on /. A step is called successful if 
it changes the actual point. Let t{x) be the random number of successful steps 
until the (1-1- 1)EA on / starting at x samples for the first time a point x' where 
val„(a:') > val„(a;). The main step is the proof that E{t{x)) < c for a constant c 
which does not depend on x. Since val„ is close enough to ONEMAX„, this result 
implies the upper bound of O(nlogn) on the expected runtime by bounding the 
expected number of unsuccessful steps similarly as in the proof of Theorem 3 for 
ONEMAX„. 

The claim E{t{x)) < c is hard to prove. Let x £ {0,1}” and let x' be 
the random point produced by the (1 -I- 1)EA on / by a successful step. Let 
S be the random set of all i where Xi = Q and x' = 1. Then S ^ % and 
Ds{x) = val(x')— val(a;) describes for some fixed set S the random gain w.r.t. val. 
The random variable Ds{x) is difficult to analyze. Therefore, a random variable 
D*g{x) taking integer values is defined such that D*g{x) < 1 and Prob(I?g(a;) < 
r) > Prob(Us(a:) < r) for all r. Then the existence of a constant d > 0 (which 
neither depends on x nor on S) such that E{D*g{x)) > d is, proved. These 
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steps need an involved case inspection. Finally, a slight generalization of Wald’s 
identity is proved to obtain the result. □ 

This result indicates the difficulty of obtaining tight results on the expected run- 
time of simple variants of evolutionary algorithms on simple functions. However, 
typical tools used for the analysis of randomized algorithms have to be applied 
also for the analysis of evolutionary algorithms. 

6 The Hard Case — Evolutionary Algorithms with 
Crossover 

Evolutionary algorithms without crossover are much easier to analyze than those 
with crossover. Crossover is defined for two points. The success of crossover 
depends in the case of uniform crossover on the number and the position of those 
bits where the parents differ. For many functions, among them many of those 
discussed in this paper, crossover seems to be unable to improve evolutionary 
algorithms. Jansen and Wegener (1999) were the first to prove for a specific 
function the usefulness of crossover. Let JUMPm,n(a;) = ||a;||i = x\ + ■ ■ ■ + Xn 
if 0 < ||x||i < n — m or \\x\\i = n and let JUMPm,n(a;) = — ||a^||i otherwise. 
The set of x where JUMP^_„(a;) < 0 is called the hole. We are only interested 
in the case m = O(logn). A random population of polynomial size contains 
with overwhelming probability no optimal point and no point in the hole . If the 
selection procedure rejects points in the hole and if we only use mutation, an 
evolutionary algorithm needs with overwhelming probability steps for 

the optimization of JUMP^.n (e > 0 constant). 

Theorem 8. Let A he the evolutionary algorithm with crossover described in 
Section 3 and let Pc{n) = l/(nlogn). 

a) Let m he a constant and f = JUMP^.n- Then 

• SA,f,t = 1 ~ if t > CfeU^logn for some constant Ck, 

• Saja = 1 ~ if t > cn^ for some constant c, and 

• Raj = O(n^logn). 

b) Let m = O(logn) and f = JUMP^.n- Then 

• SAj,t = 1 — if t> c{n^ + n^(log® n -I- 2^™)) = poly(n) for some 

constants J > 0 and c, and 

• Raj = 0{n log^ n{n log^ n + 2^’”)) = poly(n) . 

Sketch of Proof. (Jansen and Wegener (1999)). 

With overwhelming probability, the evolutionary algorithm produces within 
0(n^ log n) steps a population of search points where each search point con- 
tains n — m or n ones. This can be proved by a generalization of the solution to 
the coupons collector’s problem. Crossover is not necessary for this but it also 
does not cause difficulties. 

Either we have found the optimum or we have a population of individuals 
with n — m ones. This property does not change in the future. Let us assume 
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(incorrectly) that the individuals are random and independent strings with n—m 
ones. If we choose two of them randomly, it is very likely that they have no 
common position with a zero. Then crossover creates with probability 2“^"* the 
optimal vector 1" and mutation does not destroy this string with probability 

The problem is that crossover and also mutation have the tendency to create 
positively correlated individuals. If all members of the population are equal (or 
each pair of individuals has many common zeros) , crossover cannot destroy these 
zeros and this job again has to be done by mutation. Hence, we need crossover for 
a quick jump over the hole but crossover decreases the diversity in the population. 
This is the reason why we assume that Pc(n) is very small. (It is conjectured 
that the theorem also holds for constant values of Pc{ti) but we cannot prove 
this.) 

Now a tedious analysis is necessary to ensure that with overwhelming pro- 
bability we have a population whose diversity is large enough. More precisely, it 
is proved that for a long period of time the following statement is true for each 
bit position i. The number of individuals with a zero at position i is bounded 
above by ^n. This property implies that, with probability at least 1/2, two 
individuals chosen randomly from the current population have no common zero. 
Since Pc{n) is small enough, we may assume during these calculations that, if 
crossover does not create an optimal individual, it only has bad effects. □ 

References 

1. Back, T. (1993). Optimal mutation rates in genetic search. 5th Int. Conf. on Genetic 
Algorithms (ICG A), 2-8. 

2. Back, T. (1998). An overview of parameter control methods by self-adaptation in 
evolutionary algorithms. Fundamenta Informaticae 34, 1-15. 

3. Droste, S., Jansen, T., and Wegener, I. (1998). A rigorous complexity analysis of 
the (1 + 1) evolutionary algorithm for linear functions with boolean inputs. ICEC 
’98, 499-504. 

4. Droste, S., Jansen, T., and Wegener, I. (2000). On the analysis of the (1 -I- 1) 
evolutionary algorithm. Submitted: Theoretical Computer Science. 

5. Fogel, D. B. (1995). Evolutionary Computation: Toward a New Philosophy of Ma- 
chine Intelligence. IEEE Press. 

6. Goldberg, D. E. (1989). Genetic Algorithms in Search, Optimization and Machine 
Learning. Addison Wesley. 

7. Hagerup, T. and Riib, C. (1989). A guided tour of Chernoff bounds. Information 
Processing Letters 33, 305-308. 

8. Holland, J. H. (1975). Adaption in Natural and Artificial Systems. Univ. of Michi- 
gan. 

9. Horn, J., Goldberg, D. E., and Deb, K. (1994). Long path problems. 3rd Int. Conf. 
on Parallel Problem Solving from Nature (PPSN), LNCS 866, 149-158. 

10. Jansen, T. and Wegener, I. (1999). On the analysis of evolutionary algorithms - a 
proof that crossover really can help. ESA ’99, LNCS 1643, 184-193. 

11. Jansen, T. and Wegener, I. (2000). On the choice of the mutation probability for 
the (1 -I- 1) EA. Submitted: PPSN 2000. 




10 



I. Wegener 



12. Jerrum, T. and Sorkin, G. B. (1998). The Metropolis algorithm for graph bisection. 
Discrete Applied Mathematics 82, 155-175. 

13. Motwani, R. and Raghavan, P. (1995). Randomized Algorithms. Cambridge Univ. 
Press. 

14. Miihlenbein, H. (1992). How genetic algorithms really work. I. Mutation and 
hillclimbing. 2nd Int. Conf. on Parallel Problem Solving from Nature (PPSN), 
15-25. 

15. Rabani, Y., Rabinovich, Y., and Sinclair, A. (1998). A computational view of 
population genetics. Random Structures and Algorithms 12, 314-334. 

16. Rabinovich, Y., Sinclair, A., and Widgerson, A. (1992). Quadratical dynamical 
systems. 33rd FOGS, 304-313. 

17. Rechenberg, I. (1994). Evolutionsstrategie ’94. Frommann-Holzboog, Stuttgart. 

18. Rudolph, G. (1997a). How mutations and selection solve long path problems in 
polynomial expected time. Evolutionary Computation 4, 195-205. 

19. Rudolph, G. (1997b). Convergence Properties of Evolutionary Algorithms. Ph.D. 
Thesis. Dr. Kovac, Hamburg. 

20. Schwefel, H.-P. (1995). Evolution and Optimum Seeking. Wiley. 
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We are given n points in the plane and a vertical line £ (directed upwards) 
that intersects the convex hull, convS', of S. We want to find ‘ways’ to the first 
edge (facet) of convS' met by line £. 

Here is a first process for getting to this first edge: Choose arbitrary points 
a and b from S to the left and right, resp., of £. Now take a random sample R of 
roughly ^/ri points in S. Determine the first edge ci of conv(i? U {a, 6}) (don’t 
worry how for the moment). If no point of S lies below the line through Ci, then 
ei is already the edge of convS' we were looking for. Otherwise, determine the set 
Vi of all points in S below ci and look for the first edge 62 of conv(y Ui?U{a, b}) 
met by £. Now, either there are no points below 62 (and we are done), or we 
determine the set V2 of points below and continue. How many such phases will 
we go through until we have found the first edge of convS? What is the expected 
size of Vi U V2 U . . .? It turns out that two phases suffice, and the expected size 
of Vi U V2 is 0{^/n). 

The second process starts with an edge Ci = ab connecting arbitrary points 
a and b as above. Either no point in S lies below the line through e\ (and we are 
done), or we choose a random point c below Either edge ac or edge be will 
be intersected by l\ let this edge be 62. Now either there is no point in S below 
62, or we sample a random point below, and so on. How long will this process 
take on the average? We show a bound of 0((logn)^), and there are point sets, 
for which this bound is actually achieved. 

It has been known since the early eighties that the problem of computing 
the first edge of convS' met by i can be solved optimally in linear time by 
linear programming [2,5]. Our motivation for considering the above processes 
is actually motivated by the higher-dimensional counterparts. The first process, 
analyzed in [4], describes what is known as the multiple pricing heuristics in 
linear programming, and it is a simplified variant of an algorithm previously 
analyzed by Clarkson [1]. The second process (see [3]) describes the so-called 
random edge pivot rule for the simplex method. 
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Abstract. We study the problem of routing and scheduling requests 
of limited durations in an all-optical network. The task is servicing the 
requests, assigning each of them a route from source to destination, a 
starting time and a wavelength, with restrictions on the number of avai- 
lable wavelengths. The goal is minimizing the overall time needed to 
serve all requests. We study the relationship between this problem and 
minimum path coloring and we show how to exploit known results on 
path coloring to derive approximation scheduling algorithms for mes- 
hes, trees and nearly-Eulerian, uniformly high-diameter graphs. Inde- 
pendently from the relationship with path coloring we propose different 
approximation algorithms for call scheduling in trees and in trees of rings. 
As a side result, we present a constant approximation algorithm for star 
networks. We assume for simplicity that all calls are released at time 0, 
however all our results hold also for arbitrary release dates at the expense 
of a factor 2 in the approximation ratio. 



1 Introduction 

Scheduling a set of communication requests (calls in the following) in an all- 
optical network requires to assign a route, a wavelength chosen from a set of 
limited size and a starting time to each call, with the constraint that no pair of 
calls using the same wavelength may simultaneously use the same link. 

All-optical networks allow very high transmission rates due to both Wave- 
length Division Multiplexing (WDM) and the avoidance of opto-electronic con- 
versions. WDM allows the concurrent transmission of multiple data streams on 
the same optic fiber, subject to the constraint that different data streams use 
different wavelengths when using the same optical link at the same time and 
in the same direction [5,11,17]. The avoidance of opto-electronic conversions is 
guaranteed by optical switches, directly acting on the optical signal. The optical 
switches we consider in this paper can direct an incoming data stream on any of 
the available wavelengths to any of their outgoing links on the same wavelength. 

In this paper, we consider calls of limited durations (different calls may have 
different durations) and we address the problem of scheduling calls in order to 
minimize the overall time required to serve all of them. This problem is known 
as MINIMUM CALL SCHEDULING [4]. 



U. Brandes and D. Wagner (Eds.): WG 2000, LNCS 1928, pp. 13-22, 2000. 
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Related work. Scheduling a set of communication requests in a network in or- 
der to achieve minimum completion time is a widely studied problem in several 
network models [16,15,12,7,8,9]. Concerning all-optical networks, so far the pro- 
blem has been mostly considered neglecting call durations (i.e. considering them 
infinite) . A popular problem in the literature is minimizing the number of wave- 
lengths used to serve a set of connections having infinite durations. This problem 
is often referred to as minimum path coloring and has been widely studied. It 
has been considered in the off-line setting in trees [19,10], in mesh networks [18], 
in uniformly, high-diameter nearly-Eulerian graphs [14]. In the on-line setting, it 
has been studied in [2,3]. In [1] the authors propose algorithms to minimize the 
number of rounds necessary to schedule a set of calls in a mesh. They also show 
that any permutation can be routed in one round if the number of wavelengths is 
0(log^ m/ fP), where (3 is the edge-expansion of the network and m is the num- 
ber of vertices. Call scheduling was mainly considered in the non optical case. 
Erlebach and Jansen study the problem in stars, trees, meshes and rings, both 
in the on-line and the off-line case [7,8,9]. In particular, they propose constant 
approximation algorithms for stars and 0(logm)-approximation algorithms for 
trees (where m is the number of nodes) . The scheduling of calls of limited dura- 
tions in all-optical networks has been considered in [4] for chains and rings. The 
authors propose constant approximation algorithms for both chains and rings. 

Results of the paper. In this paper, we consider both the case in which the graph 
is directed and a call is an ordered pair of vertices requiring a directed path, 
and the similarly defined undirected case. The former model is more realistic 
[11], the latter is more frequent in the literature about minimum path colo- 
ring. In Section 3 we show how to transform a solution for the one wavelength 
case into one for the k wavelengths case, and then we study the relationship 
between minimum call-scheduling and minimum path coloring. If m and 
A denote, respectively, the number of vertices in the graph and the ratio bet- 
ween the durations of the longest and the shortest call, then the above results 
and those of [18,10,19,14] allow to obtain the following for minimum CALL- 
SCHEDULING: an 0(logZ\ • poly(loglogm)) approximation algorithm for meshes 
in the undirected case, an 0(logZ\ • log m)-approximation algorithm in undirec- 
ted, nearly-Eulerian, uniformly high-diameter graphs and 0(logZ\) approxima- 
tion algorithms for trees, both in the directed and in the undirected case. On 
the other side, the technique we propose is general and can alwayse be used to 
obtain results for minimum call scheduling from bounds for minimum path 
COLORING. 

In Sections 4 we provide an alternative approximation algorithm for directed 
trees that is independent from the results on minimum path coloring. More in 
detail, we provide an approximation preserving reduction that allows to extend 
the approximation results of [8,9] for Call Scheduling in non optical stars to 
MINIMUM CALL SCHEDULING in directed stars; we also use this result to derive 
a O(logm) approximation algorithm for directed trees. 

In Section 5 trees of rings are considered. In particular, a constant appro- 
ximation algorithm is also presented for the case of chains of rings. We remark 
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that trees of rings have practical relevance, since they are realistic topologies for 
LAN interconnection networks [19]. 

For the sake of simplicity, in this paper we assume that all calls are released 
at time 0. However all results can be extended to the case of arbitrary release 
dates at the expense of a factor 2 in the approximation ratio, using a well known 
result of [20]. 

2 Definitions 

As far as directed graphs are concerned, in the following G = (V, A) denotes a 
simple, directed graph on the vertex set V = {vq, vi, . . . , Vm-i} such that arc 
(vi, Vj) exists if and only if arc (vj,Vi) exists. A network is a pair (G, k) where k 
is the number of available wavelengths. Since a wavelength may also be viewed 
as a color, in the sequel we use the term color instead of wavelength. A call 
C = [(s,d),l] is an ordered pair of vertices s, d completed by an integer I > 0 
representing the call duration. More precisely, s is the source vertex originating 
the data stream to be sent to the destination d. 

Similarly, in the undirected case G = (V, E) denotes a simple, undirected 
graph on the vertex set V = {vq,vi, . . . and on the edge set E and 

a network is a pair (G, k) where k is the number of available colors. A call 
G = [{s,d},^] is now an unordered pair of vertices s,d completed by an integer 
I > 0 representing the call duration. In this case, s and d represent a pair of 
vertices between which a communication has to be established. 

Given a directed (undirected) network (G, k) and a set C of calls, a routing is 
a function R : C ^ 'P(G), where V{G) is the set of simple paths in G such that, 
for any G gC, R{C) is a path connecting the source of G to its destination (its 
end vertices in the undirected case). Given a routing i?, we say that Gi, G 2 € C 
conflict in R if i?(Gi) fl R{G 2 ) yf 0. 

Given a directed (undirected) network {G,k), a set C of calls and a routing 
R for calls in G, a schedule is an assignment of starting times and colors to calls 
such that at every instant no pair of calls with the same color use the same arc 
(edge). More formally, a schedule for a set C = {Ch '■ h = 1, . . . ,n} of calls is 
a pair {S, E), where S : C — >■ [0, 1, . . . is a function assigning starting 

times to calls and F : C — >■ x {!,... A:} is a function assigning colors to calls. 
In particular, each call Ch has to be scheduled for a contiguous interval of Ih 
time units starting at S{Ch), i.e. no preemption is allowed. S and F must be 
such that, if T = maxi<^<fc{S'(Gft,) -I- Ih} then, for any 0 < t < T and for any 
(u,v) G A ({u, w} £ E), no pair of calls with the same color uses (u,v) ({m, v}) 
at t. We call T the makespan of schedule S. 

In this paper we consider minimum CALL-SCHEDULING (in short, min-CS) 
defined as follows: given an directed (undirected) network (G, k) and a set C = 
{Ch ■ h = 1, . . . , n} of calls, find a routing and a schedule for C having minimum 
makespan. For the sake of brevity, we sometimes write tree-MiN-CS, ring-MiN-CS 
and so forth to denote the problem in a particular class of networks. Whether 
we are considering the directed or the undirected case will be clear from context. 
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In the special case of one available color, min-CS will be shortly denoted as min- 
CSi- In the sequel we assume that k < \C\, since otherwise the problem is trivial. 
Henceforth, an instance of the problem will be completely described by the pair 
{{G,k),C). For ease of notation, an instance of min-CSi will also be denoted by 
(G,C). 

Given an instance of min-CS and a routing R, the load Lji{e) of edge (arc) 
e is defined as the sum of the durations of calls that traverse e. Notice that, if 
L* = minfl(maxe L(e)), then L* /k is a natural lower bound to the makespan of 
any schedule for a given instance. 

In the remainder of this paper A will always denote the ratio between the 
durations of the longest and of the shortest calls in C and OPT the makespan 
of an optimum schedule for an instance {{G, k),C) of min-CS. 



3 From MINIMUM PATH COLORING to MIN-CS 

We present two theorems that allow the reduction of min-CS to minimum path 
COLORING. Notice that the results hold both in the undirected and the directed 
case. 

MINIMUM PATH COLORING has already been informally described in the in- 
troduction. It is now formally defined as follows: given an undirected graph 
G = (V, E) and a set T of source-destination pairs (s/i, du) in G, h = 1, . . . , n, 
find a set P of paths and a coloring for paths in V, such that no pair of paths 
with the same color share the same edge and the overall number of colors is 
minimized, minimum path coloring may be similarly defined in the directed 
case. Notice that, if calls have unit durations, then min-CSi in undirected net- 
works is isomorphic to minimum path coloring. The same observation holds 
in the case of directed networks. 

Theorem 1. If minimum path coloring is a-approximable then min-CSi is 
2a [log Z\] approximahle. 

Proof. In order to apply the a-approximation algorithm for minimum path co- 
loring to (G,C), we partition the set C into [logZ\] subsets Ci, . . . ,C\^\ogA\ as 
follows. To this aim, we first round up the duration of each call to the next 
power of 2 and we normalize to 1 the shortest duration. Then, we assign calls of 
durations 2*“^ to G, i = 1, . . . , [log Z\] . In this way we obtain [log Z\] instances 
(G^Pi), i = 1, . . . , [logZ\], of MINIMUM PATH COLORING, where Pi is obtained 
from Ci in the obvious way. Solving (G, Pi), i = 1, . . . , [log Z\] , we obtain a rou- 
ting for calls in Ci. A schedule for calls in Ci can be inductively derived. Suppose 
a schedule has been derived for calls in Ci U . . . UCi_i and let Ti_i be its comple- 
tion time; the schedule for calls in Ci is built as follows: if the path corresponding 
to a call G in Ci has received color j in the solution of (G,Pi), then G starts at 
time Ti_i + {j — 1)2*“^. The resulting schedule has length at most 2a [log Z\] the 
length of the optimum schedule, since a factor at most 2 is lost when rounding 
call durations. □ 
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Theorem 2. If there exists a polynomial-time a -approximation algorithm for 
MiN-CSi, there also exists a polynomial-time 2a -approximation algorithm for 
MIN-CS. 

Proof. Let {{G,k),C) be an instance of min-CS and let (G,C) be the correspon- 
ding instance of min-CSi. Denote by S and T, respectively, an a-approximation 
schedule for {G,C) and its completion time. Given S, the following algorithm 
produces a schedule Sk for {{G,k),C): 

Algorithm from-PtO-k-Colors 

input: G,C, k, a schedule S for 1 wavelength; 

output: a schedule {Sk, F) for G,C, fc; 

begin 

for j := 1 to fc do 
begin 

Vj~{G:GgCA S(G) G [{j - l)T/k,jT/k)}; 

Assign color j to calls in T>j: F{G) = j for each G G Dj; 
for any call G G T>j do 

Sk{G) ~ S{G) - {j - l)T/fc; 

end 

Output (Sk,F)-, 

end. 



Clearly, since k < \C\, algorithm from-lGo-k-Colors runs in polynomial time. 

After the execution of algorithm from-lGoJt-Colors we are left with a feasible 
schedule Sk for min-CS. In fact, each call in C has been assigned a color and a 
starting time. Furthermore, for each j, the relative delays between calls in T>j 
have not changed with respect to S, hence Sk is feasible. 

Let us now derive the completion time of Sk- For any wavelength j, let Tj 
be the completion time of calls that are assigned color j. Finally, let OPTi and 
OPTk respectively denote the optimal values for (G,C) and {{G,k),C). For any 
j, let = max{l : G GVj}. We have: 



JT {j - l)T 
k k 



+ li 



T 

< - -gP 

max _ 7 I ‘'max* 

k 



Two cases may arise: 

1- ^max < f-- In this case Tj <2^ < 2a < 2aOPTk- The last inequality 
is proved by contradiction: if > OPTk, we could use an optimal al- 

gorithm for MIN-CS to schedule instances of min-CSi and the corresponding 
completion time should be less than OPTi). 

2. Since OPTk > ^^ax> then OPTk > ^ and therefore Tj < 20PTk- 



This completes the proof. 



□ 



Corollary 1. There exist polynomial-time approximation algorithms for min 
CS in undirected meshes, directed and undirected trees and undirected nearly- 
Eulerian, uniformly high-diameter graphs with approximation ratio i) 0(logZ\ • 
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poly(loglogm)) for undirected mesh networks; ii) 0(4.4 log Z\) for undirected 
trees and (20/3) log Z\ for directed trees; Hi) 0(logZ\ • logm) for undirected, 
nearly -Eulerian uniformly high-diameter networks. 

Proof. Consider an instance {{G,k),C) of MIN-CS. 

If G is an undirected mesh, we first solve min-CSi by applying Theorem 
1 and exploiting the polynomial-time 0(poly (log logm) (-approximation algo- 
rithm for PATH COLORING in the mesh proposed in [18]. This yields an 0(logZ\- 
poly(log logm) (-approximated solution for {{G,k),C). By applying Theorem 2 
the thesis follows. 

The proof for trees is similar to the previous case. For undirected trees we use 
a result in [7], improving that of Raghavan and Upfal [19], yielding a 1.10PT-\-0.8 
approximated algorithm for minimum path coloring in undirected trees. Then, 
we apply Theorems 1 and 2 to derive an 0(4.4 log A) approximated schedule for 
MIN-CS. In the directed case we use a result of Erlebach et al. [10] achieving with 
the same technique an approximation ratio of (20/3) log Z\ for min-CS. 

Nearly-Eulerian, uniformly high-diameter graphs are planar graphs that ex- 
tend the mesh, minimum path coloring in these graphs was studied in [14], 
where an 0(log m)-approximation algorithm was presented. Proceeding the same 
way as in the previous cases we prove the last part of the Corollary. □ 

As remarked in the introduction, the results presented in this and the next 
sections can be extended to the case in which each call has a release date at the 
expense of a factor 2 in the approximation ratio, using a result of [20]. 

4 Call Scheduling in Directed Trees 

A directed star Em is a directed graph with vertex set {u,v\, . . . ,Vm} and arc 
set {{u,Vi), (vi,u) : i = l,...,m}. An instance of the problem is now a pair 
{Em,C), where C is a set of directed calls. 

We now show an approximation preserving reduction of call scheduling from 
directed star networks to undirected stars. 

Theorem 3. Given an instance Id of directed star-MiN-CS there exists an in- 
stance lu of undirected star-MiN-CS such that there exists a one-to-one corre- 
spondence between feasible solutions of Id and and the makespan of corre- 
sponding solutions is the same. 

Proof. Let Id consist of a directed star Em and a set of calls C in Em. The 
corresponding instance = (EfmiC^) is derived as follows: 

1. each vertex v in Em is replaced in by the pair Vi,Vo of vertices; 

2. each call G = (s, t,d) G C is replaced in by a new call C“ = (sj, to, d). 

Notice that two calls in C conflict in Em if and only if the corresponding two 
calls in C“ conflict in Elf^. Consider a feasible solution Sd of Id with makespan T; 
the corresponding schedule S'„ for is obtained as follows: if call G is scheduled 
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at time t in Sd, the corresponding call C“ is scheduled at time t in Su- Hence 
the makespan of S'„ is T as well. 

The proof of the converse is analogous. □ 

The theorem above yields an algorithm for star-MiN-CSi in the directed case as 
stated in the following 

Corollary 2. There is an algorithm for stor-min-csi in the directed case that 
produces a schedule whose completion time is at most 2 times the optimum. 

Proof. Use the result of [9] for Call Scheduling in non-optical star networks and 
apply Theorem 3. □ 

The generalization to the case of k colors easily follows from theorem 2, as 
stated in the next corollary. 

Corollary 3. There is an algorithm for stor-min-cs in the directed case that 
produces a schedule whose completion time is at most 4 times the optimum. 

In the case of tree networks we use the following well known separator theo- 
rem. 

Theorem 4. [6,21] Let T he an m-vertex tree. There exists at least one vertex 
(called a separator or median) whose removal splits the tree into subtrees, each 
containing at most mj2 vertices. 

Theorem 5. There exists a polynomial-time algorithm that finds a schedule for 
the instance (T,C,k) of tree-MiN-CS in the directed case and whose completion 
time is at most 

20 

min{41ogm, log A} • OPT. 

Proof. We first solve tree-MiN-CSi. To this purpose, if the minimum value is 
4 log TO, we use the following algorithm: 

1. choose a median vq in T. Assign vq level 0. Let Ti,. . . ,Th be the subtrees of 
T obtained by removing vq. Schedule calls that traverse uq using the min-CS 
algorithm for directed stars. 

2. For i = 1, . . . ,h, recursively apply the above procedure to each subtree Ti 
and to the corresponding calls. At the j-th step of the recursion, each new 
median is assigned level j. The procedure terminates when each vertex in T 
has been assigned a level. 

The above procedure assigns a level I to each vertex in the tree, with 0 < 
I < log TO — 1. This way, each call may also be given a level, uniquely defined as 
that of the lowest level vertex the call uses. This induces a partition of the set 
of calls into logTO sets Co, . . . ,Ciogm-i- Notice that: 

i) : all calls of the same level using the same median can be scheduled using 
the MIN-CS algorithm for directed stars; 

ii) : all calls of the same level, but using different medians do not share any 
edge along their routes and can therefore be scheduled independently and in 
parallel. 
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Calls are scheduled in phases. In particular, during phase j calls in Cj {j = 
0, . . . ,logm — 1) are scheduled. More in detail, for each median v of level j an 
instance {S{v),C{v)) of star-MiN-CSi is solved, in which U(v) is the star in T 
with central vertex v and C{v) is the set containing all calls in Cj that traverse 
V. Notice that, if z < j, then all calls of level z are terminated before the first 
call of level j is scheduled. The considerations above and Corollary 3 imply that 
the partial schedule of each phase has makespan at most AOPT. Since there are 
log m phases the overall schedule has makespan at most 4 log m ■ OPT. 

Instead, if the minimum value is (20/3) logZ\ we apply Corollary 1. □ 



5 Trees and Chains of Rings 

A Tree of rings [19] can be obtained from a tree by replacing each vertex of the 
tree by a ring under the following constraints: i) rings corresponding to non- 
adjacent vertices of the tree share no vertex; ii) rings corresponding to adjacent 
vertices of the tree share exactly one vertex. 

To solve the directed case we use results of [4]. The undirected case can be 
solved in the same way using results of [13,4]. The proofs will be given in the full 
version of the paper, here we only present our results and try to give a flavour of 
the techniques used. Let TTZ denote a tree of rings. We solve the general problem 
of call scheduling in trees of rings (both in the directed and the undirected case) 
in two steps: 

1. Perform a routing of calls with minimum load. 

2. Schedule calls using the routing obtained in step 1. 

The main issue in step 1 is routing in the ring. This problem has been solved 
in both the directed and in the undirected case [13,4]. In both cases polynomial 
time approximation algorithms have been proposed. Let L* denote the load of a 
minimum load routing for {{'TTZ, k),C). Of course L* and L* /k are lower bound 
to the optimum makespans of min-CSi and min-CS respectively. 

In step 2 we schedule calls routed in step 1 proceeding similarly as in the 
case of trees, the main difference being that in each phase we solve instances of 
MIN-CS in ring networks. In conclusion, we prove the following theorem: 

Theorem 6. There exists a polynomial algorithm to solve dir-min-CS in trees 
of rings with approximation ratio O(logr), r being the number of rings in the 
networks. 

We observe that while obtaining a constant approximation ratio is still open, 
not surprisingly the problem is 0(l)-approximable in the special case of chains 
of rings, i.e. when the underlying topology is a chain. In this case we use a 
different algorithm, a major step of which is reducing the instance {{'TTZ,k),C) 
to a proper instance of chain-MiN-CS in a non trivial way and then using the 
results of [4] (directed case) or [13] (undirected case). This allows us to claim: 

Theorem 7. There exist approximation algorithms that solve min-CS in chains 
of rings with approximation ratio (22 -|- e), for any e > 0. 
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The constant is high, the main contribution deriving from the scheduling of 
calls that are internal to the single rings of the network. In [4] it is shown that 
obtaining a constant lower than 11 for ring-MiN-CS may be a non trivial task. 
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Abstract. The communication overhead is a major bottleneck for the 
execution of a process graph on a parallel computer system. In the case of 
two processors, the minimization of the communication can be modeled 
by the graph bisection problem. The spectral lower bound of ^2^ for the 
bisection width of a graph is well-known. The bisection width is equal 
to iff all vertices are incident to ^ cut edges in every optimal 

bisection. We discuss the case for which this fact is not satisfied and 
present a new method to get tighter lower bounds on the bisection width. 

This method makes use of the level structure defined by the bisection. 

Under certain conditions we get a lower bound depending on Aj |U| with 
I < < 1. We also present examples of graphs for which our new 

bounds are tight up to a constant factor. As a by-product, we derive 
new lower bounds for the bisection widths of 3- and 4-regular graphs. 

We use them to establish tighter lower bounds for the bisection width of 
3- and 4-regular Ramanujan graphs. 

1 Introduction 

The relation between the second smallest eigenvalue of the Laplacian and the 
bisection width of a graph has already been discussed in the literature. A bisec- 
tion of a graph is an equal distribution of the vertices of a graph into two parts. 
The cut size is the number of crossing edges between the two parts. The bisec- 
tion width is the minimal cut size of all bisections. Its calculation for arbitrary 
graphs is JVP-complete [8], even when it is restricted to the class of d-regular 
graphs [3]. The bisection of a graph and, more generally, the partitioning of a 
graph into several balanced parts have a wide range of applications, especially 
in the field of parallel computation. 

Many different methods of calculating partitions of a graph have been pro- 
posed in the past by scientists from different fields like mathematics, computer 
science or engineering. There are also many software libraries available which 
include the most efficient graph partitioning approaches. Most heuristics have 
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European Union ESPRIT LTR Project 20244 (ALGOM-IT). 



U. Brandes and D. Wagner (Eds.): WG 2000, LNCS 1928, pp. 23—34, 2000. 
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been used widely on examples in many applications and the partition qualities 
are satisfying. 

This is not the case for lower bounds on the bisection width and there are 
only few approaches that are known. Leighton [11] proposes a lower bound of the 
bisection width by calculating a routing scheme between all pairs of vertices such 
that the congestion is minimized. This lower bound is strict for some graphs such 
as complete graphs, hypercubes, grids or tori. However, for irregularly structured 
graph, as they appear in most applications, remains the problem to calculate a 
routing scheme with a low congestion. 

Lower bounds on the bisection width can be derived from algebraic graph 
theory by relating the bisection problem to an eigenvalue problem. The well 
known lower bound on the bisection width bw of a graph G = (V, E) with 
n = |V"| and second smallest eigenvalue A 2 (G) of the Laplacian is 



hw > 



\ 2 {G)-n 

4 



( 1 ) 



In theorem 1 we show that the equality holds for (1) iff all vertices are incident 
to exactly ^ cut edges. This is the case for some graphs, e.g. complete graphs, 
complete bipartite graphs, hypercubes or the Petersen graph. Nevertheless, there 
is a large gap between this lower bound and the bisection width for most graphs. 
To give an example, crash or flow simulations can be solved by the Finite Element 
Method. The mesh used to discretized the object often has a structure similar 
to a grid. The second smallest eigenvalue of a x grid is A 2 = 2 — 2 • 
cos( 7 r/\/n) « 7 T^/n. Obviously, the bisection width is -Jn. However, the lower 
bound of equation (1) does only result in a value of Here, a relation of bw 
and -\/A 2 would give a tight bound. 

Such a quadratical gap also appears in the relation of the isoperimetric num- 
ber and A 2 [4,6,14]. The isoperimetric number of a graph G = (V, E) is defined as 

i{G) = min 5 ^y_o<|S|<f where E{S, S) is the set of edges which connect 

vertices of S with vertices of S. In [6,14] it is shown that 



\G) 



2 • 



.(G) 



< A2(G) < 20(G) 



with maximal degree dmax{G). Additionally, it is A 2 = O(^) for bounded-degree 
planar graphs and two-dimensional meshes and A 2 = 0 (;,^) for well-shaped d- 
dimensional meshes [18]. 

If the lower bound of equation (1) is not strict, either all vertices are still 
incident to cut edges but not to exactly ^ of them, or there are vertices which 
are not incident to any cut edge. We will present a new method to determine an 
upper bound for A 2 . This method makes use of the level structure of a bisection 
in which every level consists of all vertices which have the same distance to the 
cut. This will lead to a generalized lower bound of equation (1) depending on the 
growth of the sizes of these levels on each side of the cut. Let 5 : IN — >■ IN be a 
function. We will introduce the class of Level Structured graphs LS{g, a) which 
have a bisection with a cut size of cr and a level structure such that there are not 
more than ag{i) edges between vertices of distance i to the cut and vertices of 
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distance i + 1 to the cut. We show that if the sum 1 + 2 ^ constant 

A, then for - — >■ oo it is 

' <7 



o>A^^^{l + o{l)). ( 2 ) 

Thus, we established a relation of a and X^n in the range \ < /? < 1- For 
equation (2) we show that there are graphs from LS{g,a) for which the bound 
is tight The techniques used in this proof might be of special interest for some 
readers since we develop new methods computing asymptotic exact the second 
smallest eigenvalue of the laplacian for certain weighted paths. 

We get an improved relation of the bisection width and A 2 if the growth of the 
level sizes defined by an optimal bisection is bounded by some function g{i). In 
general, the level structure of an optimal bisection is not known, but there is one 
notable exception. If G is a graph of maximum degree d, then G G LS{g, bw{G)) 
with g{i) = {d — 1)*. Thus, for — >■ 00 , equation (2) results in a lower bound 

We expect that there exist further classes of graphs for which equation (2) can 
be used directly to derive a lower bound on the bisection width. 

As a by-product, we get better lower bounds on the bisection width of 3- and 
4-regular graphs. We use them to establish tighter lower bounds for the bisection 
width of 3- and 4-regular Ramanujan graphs. Ramanujan graphs of degree d have 
A 2 > d — 2\/d — 1 and, thus, for a fixed d it holds bw = 0(n). There are several 
methods of explicitly constructing Ramanujan graphs [5,12,13,15]. Equation (1) 
leads to lower bounds of 0.043n and 0.134n for the bisection widths of 3- and 
4-regular Ramanujan graphs. With the use of the level structure of a bisection 
we can prove that any 3-regular Ramanujan graph has a bisection width of at 
least 0.082n and any 4-regular Ramanujan graph a bisection width of at least 
0.176n. These are the highest lower bounds for explicitly constructible 3- and 
4-regular graphs. However, there are 3- and 4-regular graphs with a higher lower 
bound on the bisection width. Kostochka and Melnikov [10] have shown that the 
bisection width is at least « O.lOln for almost all 3-regular random graphs 

and Bollobas [2] has shown that the bisection width is at least = 0.22n for 
almost all 4-regular random graphs. 

Section 2 presents the basic relation of A 2 and the bisection width. In section 3 
we show a new relation between A 2 and the cut size cr of a bisection. In section 4 
we demonstrate that there are graphs for which the new bounds are tight up to 
a constant. 

2 Definitions and Background 

A bisection tt of an undirected graph G = (V,E) with ]Rj = n is a partition 
V = Vo U hi such that [Vq] = [Vi] = n/2. In the following we assume that n 
is even. For a bisection tt, the number of edges between Vq and V^i is called the 
cut size of 7T. The minimum cut size of a bisection is called the bisection width 
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bw{G) of G. For a graph G, the nx n Laplacian matrix L{G) = {ly,w} is defined 
as 

i deg(w), if u = w 

— 1, it V ^ w and {w,w} G E 
0, otherwise 

It is known that all the eigenvalues of L(G) are nonnegative. We denote them by 
Al, . . . , A„ with 0 = Al < A 2 < ... < A„ and pairwise perpendicular eigenvectors 
yi,y 2 , ■■■Vn- The eigenvector to the eigenvalue 0 is yi = ;^(1, 1, 1, ...1) and its 
multiplicity is equal to the number of connected components in the graph (see 
e.g. [17]). We consider connected graphs only, so A 2 > 0. Let x = {xi,X 2 , ...,x„) 
be a non-zero vector. It follows from the Courant-Fisher principle that 



A2(G) 



. f x^Lx 1 
I |T1T¥ r 

x±l ( \\x\\-‘ J 



= mm • 

cc_Ll 






(3) 



Furthermore, the minimum in (3) is attained iff x is an eigenvector to A 2 . Using 
the Lagrange identity, (3) can be rewritten (cf. [7]) as 



A2(G) 



min 

lET^const 






(4) 



Here the minimum runs over all vectors not collinear to (1,1,...,!). Now a 
simple lower bound on bw{G) can be derived by applying (4) to the x-tuple 
defined by 



Xv 



a, if u e Vo , , 

, .„ with some a ^ b. 

0 , if u G Ui 



(5) 



This leads to A 2 < n ^1,2 = j 0 ^ the well-known lower bound of 

bw > which is strict for some classes of graphs. In the following theorem 

we completely specify the situation when this bound is attainable. 



Theorem 1. Let G = (U, E) be a graph and A 2 be the second smallest eigenvalue 
of L{G). Then the following statements are equivalent: 

a. bw{G) = 

b. there is an eigenvector corresponding to X 2 which has only —1 and -1-1 
entries; 

c. in any optimal bisection V = Vq U Vi any vertex is incident with ^ cut 
edges. 



Proof. We prove that statement (a) implies (b), (b) implies (c), and (c) implies 
(a). 



Assume bw{G) = and let U = Vq U Ui be an optimal bisection. Consider 
the vector {xi , . . . , with = 1 for m G Vo> Etnd a;„ = — 1 for m G Vi. Now (4) 

= A 2 . Therefore, x is an eigenvector 



implies A 2 < ^ 



^ ^ Abw(G) _ 



to A 2 . 
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Now assume there is an eigenvector x corresponding to A 2 specified above. 
Let u G Vo (a similar argument works well for u € Vi), and let a„ (resp. 6„) 
denote the number of vertices of Vq (resp. V\) incident to u. Then the uth entry 
of L{G)x equals deg(M) — a„ + Since L{G)x = X 2 X and the uth entry of x is 
1, then deg(u) — a„ + 6^ = A 2 . This equality along with = deg(u) implies 

h — X2. 

Finally, let V = VbUVi be an optimal bisection. Assume any vertex is incident 
with exactly ^ cut edges. Since |Vol = I^Al = n/2, then the size of the cut equals 
^ On the other hand the size of the cut equals bw{G) because the bisection 
is optimal. □ 

Therefore, there are plenty of graphs for which bw{G) = holds. Exa- 
ples are the complete graphs, the complete bipartite graphs, hypercubes or the 
Petersen graph. 



3 New Relations between A 2 and cr Derived by the Level 
Structure 

As it is shown in theorem 1, the lower bound (1) is tight only if any vertex of 
G is incident to a cut edge. We consider the case where this condition is not 
satisfied, and show that for such graphs this lower bound can be significantly 
improved. 

Definition 1 (Level Structure). Let tt be a bisection of a graph G = {V,E) 
with V = Vb U Vi and cut size a. Denote the subsets Vq o/Vq os follows. Let Vf 
be the set of all vertices in Vq which are incident to a cut edge. Let Vq be the 
sets of all vertices in Vq with distance i — 1 to a vertex of Vq. Denote with Vf 
the according sets on side V\. Furthermore, denote with Eq, i> I, the edge sets 
which connect vertices between sets Vq and Vq~^^ and with E\ the same in part 
Pi. 

Let g : IN — >■ IN be a function. We denote with LS{g,a) the class of graphs 
which have a bisection tt with a cut size a and a level structure such that \Eq\ < 
(jg{i) and \E\ \ < ag{i) for all i > 1. 

In lemma 1 we bound A 2 from above by some expression depending only on 
the grow function g{i). Indeed, the proof of the lemma shows that the worst case 
occurs if \Vi\ = g{i — 1) holds for any i. This result will be used in theorem 2. 



/-I 

Lemma 1. Let G € LS{g,a). Let ? G IN &e a number such that n > 2cr ^ ff(*~ 

i=l 



I). Then 



\2{G) < min 

l = ai<a2<---<a.^ 



2o-gf + a 9{i){ai ~ m+if 

^ o?5(* - 1) + a? (f - o- 9{i ~ 1)) 
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Proof. We need a non-zero vector x with x_Ll for equation (3), whereas we do 
only need a non-constant x for (4). Consider (4) and choose a vector x with 
entries Xy for v G V defined by 



/ ai, if u € Vf 
\ -ai, if u € W 



(6) 



with 1 = Oi < 02 < • • • < o;. The upper bound of equation (4) now only 
depends on the afs. Denote by A(x) := ~ numerator of 

(4). One has 

A(x)=4aai + y^ (|£'J|-|-|-Bl|)(oi-ai+i)^ < 4craf-|-2cry^ g(i)(ai~ai+i)^ . (7) 

Now we estimate the denominator B{x) := of (4). It holds 

\Vd\ < l^o~^l ^ ~ !)• Assume now that \Vd\ < ag{i — 1) for some j, 

1 < j < /. Denote by x the corresponding vector for the level structure obtained 
by moving a vertex from to Vff . One has 



B{x) — B{x) 

i-i i i 

= - - ajf\Vd\ - Al - ajf + ^((Oi -|- ajf - (ai + aif)\Vl\ 

i=l i=l i = l 



= ^ ^((a« - ajf - (ai - aif - (ai - ajf)\Vd\ - (ai - ajf 

+ 7 ^ ^((a« + o.jf - Ai + aif + Al - 

= 2(ai - aj)(ai - ai)\Vg\ - (ai - aj)^ + 2(ai + aj)(aj - ai)\V^ \ < i 



To establish the first equality we used |Io*| = 1^*1 ~ Si=i 1^*1) while the 

inequality follows from ai > ai for 0 < i < 1. Therefore, B(x) does not increase 
because of this transformation and the minimum of B{x) (for a fixed V\ part) is 
attained if \Vq \ = ag{i — 1 ) for 1 < f < L 

A similar argument provides \Vf \ = ag{i — 1) for 1 < f < Hs sufficient for 
B{x) to attain its minimum. Therefore, in this case \Vq\ = \Vf\ for 1 < i < L 
Thus, a;Tl and the denominators of (3) and (4) (multiplied with are equal. 
These arguments imply 






/ Xv'^ > 2cry^ 



afg{i -l) + 2af (J^- a ^ s(* - 1)) • 



(8) 



The lemma follows by substituting (7) and ( 8 ) into (3). 



□ 



Theorem 2. Let G G LS{g,a). Then there exists a function 7 : IR"*' — >■ IR with 
'y(x) -G 0 for a; — >■ 00 such that 

- if A-.= 1 + 2YT=2 ^(T=r^ < 00 , then a > -k 7 (y)). 

“ */ ff(*) = (* + 1 )> then a > LambertW{j^^)^^^^^^{l + ^{^)), where 
LambertW {x) is the inverse function of x ■ e^ . 
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“ */5(*) = (* + 1)“ md 0 < Of < 1, then a > 6 {a)\ 2 {G) 2 n(l + 7(^)) where 
6 (a) = ^ — s+n- 

2((l-a)(3-a)) — 

Proof. Let I G IN be defined by — 1) < n < ~ !)• 

This, in particular, implies - — >■ c» as / — >■ 00 . We apply lemma 1 with = 

1 Since 2a? + ^^(ai - a*+i)^ = 2 + 4^Y.\=\ ^ = 2a; 

then 



A 2 ^ 



2aai 



2 ai 






(9) 



In the following we use some functions 7 ^ with the property described in the 
theorem. 

• A := 1 + 2 ^ Set ri := A — Qi, 1 < i < I, with > 0. The 

first inequality of equation (9) implies 



A 2 ^ 



2cr(A - n) 



■ - 22lri + rf)g{i - 1) + {A^ - 2An + rf) (^f - cr g{i - 1)^ 

2a{A — n) 

cr^^(l + 7i(y))Elli5(*- 1) +^^(1 + 72(y)) (f “ a E!Ii “ 1)) 

4a 



- T.n(l + 7 ( = )) • 

• gi'i) = {i+ 1): Note that ln(n + 1) = ^dx < Er=i y < 1 + /" = 

1 + ln(n). Therefore, the second inequality in (9) with g(i) = i + 1 provides 

2ai ^ 41n(/) + 2 



A 2 ^ 



< 



l + EL2*(21n(* + l)-l)^ 



< 



41n(0 + 2 



< 



'-1 - Z2inm(l +7i(ii)) ■ 

1 + J (4a;ln^(a: + 1) — 4xln(a: + 1) + ®)da; 

1 

It follows from n < 2 a Ei=i * = 2cr^(l + 72 (y)) that I > y/y(l + 72 (y))- This 
leads to A 2 < ^ (1 + 72 (^))- Solving this equation in a we get the result 

nln(wn/cr) 

of the theorem. 

• g{i) = (i + 1)“ and 0 < a < 1: Since E;=i T ®(i^)) ^ 



it is 

A2<^^< 



ji — Q 

It 

1 — a 



0+1 ' 



< 



(1 — a)(3 — a) 

— l-l — l-l l-l — p(l I ' 

E*“a? E*“4(iE^)2(l+7i(y)) E + 71 (g)) 

i=l i=\ 2 = 1 

From n < 2 ctELi*“ = 2cr^(l + 73 (y)) it follows that I > 



This leads to 

(l-a)(3-a) 



A 2 



(=4g^)^ ( 1 + 74 ( 7 )) 



i. e. a > 



2a(l+73(f))' 

1 + Q , ^47 ,, ,n.. 

G+rA2 »i(l+7(-))-a 



2((l-«)(3-a)) — 
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If G is a graph of maximum degree d, then G G LS{g, bw{G)) with g{i) = (d— 1)% 
because max{|V^^|, IV^^I} < bw{d— 1)*. Thus, theorem 2 implies 

6w(G) >^^(l-o(l)) = as ^-> 00 . (10) 



In the following, we derive bounds for all 3- and 4-regular graphs. One ex- 
ample are the d-regular Ramanujan graphs. For these graphs it holds A 2 > 
d — 2y/d — 1. Notice that the latter value is asymptotically the largest possible 
for d-regular graphs, since X 2 < d — 2^d— 1 -|- log^ ^pn7-T ?(T) 

12]). There are known construction of infinite families of d-regular Ramanujan 
graphs for any d of the form d = + 1, where p is any prime number, and k is 

an arbitrary positive integer (see [5,12,13,15]). 

Theorem 3. The bisection width of any 

1. 4-regular graph with A 2 < 2 zs at least min{^, 

2. 3-regular graph with A 2 < 2 zs at least min{^, • ^|^}. 

3. 3-regular graph with A 2 < is at least min{ | } . 



Proof. As mentioned above, for any d-regular graph G it holds G G LS{g, bw{G)) 
with g{i) = {d— 1)*. Solving the equation of lemma 1 for bw{G) we get 



bw{G) > max 

l=ai 



o?A2| 

2«1 + -f A2(d - - af)] 



( 11 ) 



with optimal values oi = 1, 02 = 
We only have to make sure that 1 = 
‘^bwY!r[g{i- 1 ). 



^d-i ^ rij+i — aj 

di < tt 2 < ■ ■ ■ < ai provided that n > 



4-regular, A 2 < 2 and bw < ^: We apply equation (11) with d = 4 and 1 = 2. 



5 -A 2 



and n > 2bw = ‘2bwJ2\^\ g{i — 1). We get 



We have 1 = oi < 02 = 
bw > ■ § in this case. 

3-regular, A 2 < 2 and bw < ^: We apply equation (11) with d = 3 and I = 2. 
We have 1 = Oi < 02 = and n > 2bw = 2bw ~ !)• We get 

bw > • f in this case. 

3-regular, A 2 < and bw < We apply equation (11) with d = 3 and 

I = 3. We have 1 = oi < 02 = < as = and n > 6bw = 

in this case. □ 



Y.\=\ - 1). We get bw > 2 



From A 2 < d—2yfd^^-\-0{\ogd-i(n)) ^ [16,1,12] follows that the conditions 
A 2 < 2 and A 2 < ~ 0.44 of cases 1 and 3 of theorem 3 hold true for 

sufficiently large graphs. The results of theorem 3 can not generally be improved 
by using more levels. 




New Spectral Lower Bounds on the Bisection Width of Graphs 



31 



The bounds of theorem 3 are at least as good as the bound of equation (1). 
For the first two cases, the new bounds are identical to the traditional bound 
for A 2 — >■ 2. For A 2 — >■ 0 the first case is by a factor of ^ and the second case is 
by a factor of 2 higher than the traditional bound. For A 2 — >■ 0 the third case is 
by a factor of | higher than equation (1) and for A 2 — >■ it is higher by a 

factor of « 1.52. 

Theorem 3 can be used to derive stronger lower bounds on the bisection 
width of Ramanujan graphs. 

Corollary 1. The bisection width of any sufficiently large 3-regular Ramanujan 
graph (such that A 2 < ) is at least 0.082n. The bisection width of any 

sufficiently large j-regular Ramanujan graph (such that X 2 < 2) is at least 0.176n. 

This improves upon the previously known lower bounds of 0.042n for 3-regular 
and of 0.133n for 4-regular Ramanujan graphs derived by equation (1). 

4 Worst Case Graphs 

In this section we present examples for which the results of the previous sec- 
tion are asymptotically tight. Firstly, we introduce a new class of edge-weighted 
graphs. They belong to the class LS{g,a) with g{i) = {i 1)“. We show that 
the results of theorem 2 are tight for this graph class if a > 1 and tight up to 
a constant factor if 0 < a < 1. Secondly, we analyze the double-root trees as a 
more realistic example. 

4.1 Worst Case Graphs with Edge Weights 

The Laplacian of a graph can be generalized for edge weighted graphs where 
the off-diagonal entries contain the negative values of the corresponding edge 
weight. 

Definition 2. Denote with Bf , a > 0 and I > 1, the edge-weighted graph ob- 
tained as follows. Bf consists of two symmetric sides. Each side consists of I 
levels of vertices with vertices in level i, 1 < i < 1. Edges connect every pair of 
vertices in consecutive levels. Each edge between vertices in levels i and i -\- 1 is 
weighted by Moreover, the single vertices on level 1 of each side are connected 
by an edge of weight 1. 

The graphs Bf belong to the class LS{g, a) with g{i) = {i -\- 1)“. We need the 
following lemma to prove the main theorems. 

Lemma 2. The eigenvalues of a matrix of the structure ^ ^ are the union 
of the eigenvalues of the matrices A-\- C and A — C 

Proof. Let x be an eigenvector of the matrix A-\-C. Then (x, x) is an eigenvector 
to the matrix ^ ^ for the same eigenvalue. If y is an eigenvector to the matrix 

4— C then {y, —y) is an eigenvector of the matrix ^ ^ to the same eigenvalue. 

□ 
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The following lemma can be shown using the separation theorem of [19]. 

Lemma 3. Let \ 2 {Bf) < 2, then the entries in the eigenvector of to the 
eigenvalue A 2 are equal for all vertices on the same level. Moreover x'f = —x\, 
with xf being the entry of the corresponding eigenvector on level i on side s G 
{ 0 , 1 }. 

Now we are going to calculate lower bounds for A 2 (i?“) and it follows from 
theorem 2, that there exists an Iq, such that for any I > IqH holds < 2. In 

the following we consider I > Iq. We denote with Xi the entry of the eigenvector 
of level i to the eigenvalue A 2 of . In section 3 we used as an approximation 
for Xi- Now we set Xi = ai + Vi, where ri = —\2Pi and Oi = 1 + 2 . Here 

Vi represents an error and in the following our method consists in showing that 
the rfs are small compared with a^. 

First, we consider the case a > 1. For this case we can show in general for 
a Graph G that the upper bound for A 2 is asymptotically tight if the results 
of Lemma 3 hold for G and + = 1 + 2 < 00. Let Qi be defined as 

qi+i = qt + a(i)(qi - qi-i + A), qi = 0, 52 = From the definition of qi, we 
obtains, = f{gHandft+i-<z* = ^+EpEi 



Lemma 4. For a graph G with A = 1 + 2 E^i J(J) < 00 it holds qi+i — qi > 
Pi+i — Pi > 0 for any 1 < i < I — 1. Moreover A 2 ® — >■ 00 



Proof. We first show the second statement of our lemma. 



3=1 






q = l p=l 



p=l q=p+l 






Conform theorem 2 it holds A 2 < 0{l~'^) and therefore the second statement of 
the lemma holds. For the first statement let Si = qi — pi. We use induction on 
i. For i = I the lemma holds. We assume that the lemma holds for any j < i. 
Then 



5i+i = <Ji+i - Pi+i = <Ji + a{i){qi - qi-i + A) - pi - a(i)((l - \ 2 )pi - Pi-i + fli) 

= + a{i){5i - 5i-i) + a{i){A - at) + a(i)(pi_i + \ 2 Pi) > Si 

Now we have to show, that Pi+i —PiPO. Our assumption was, that the lemma 
holds for any j, 0 < j < i. This implies that 0 < Pi < qi. Then pi+i — Pi > 
a{i){ai — A 2 < 7 i) > 0, where A 2 < 0{l~'^) because of theorem 2 and therefore the 
last inequality holds. □ 

We state a lower bound for A 2 (G) in the next theorem with A = 1+2 < 

00 . 

Theorem 4. If For a graph G with A = 1 + 2 J{j) < 00 if the results of 
lemma 3 hold, then we obtain 

- A ' n(l + o(l))' 
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Proof. Let us consider A 2 = fl[x) ’ "'^here x is the eigenvector to the eigenvalue 
A2. Now using lemma 3 it holds fi{x) = — Xi)^g{i) and 

f 2 {x) = 2Y!i=ixlg{i - 1). We get 

;-i i-i 

/i(a:) > 4 + 2 ^ - 8 A 2 ^(qi+i - qt) > 4a; - o{l) 

i=l i=l 



f2(x) < 2 + xlql)g{i) < 2 ^ aip(i) + o(Z^) < 2 ^ afg{t) + o{l^) 



Calculating the bounds for f\{x) and f^ix) we obtain the result of the theorem. 

□ 



This theorem provides also results for the graph class Bf which is a subclass of 
the graphs considered above. 

In the next theorem we consider the case 0 < a < 1. 



Theorem 5. 7/0 < a < 1 then there exists a constant c{a) such that \ 2 {Bf) > 

c(a) 1 with da) — ^(i-Q^)(3-q)(7-q) 

P{l+o{l)) — 4(7-a) + (l-a)2(3-a) ■ 



The technique which is used is the same as for theorem 4. 

We summarize the results of this section and compare them to the results of 

theorem 2. For the case a > 1, the upper and lower bounds on A 2 of theorems 2 

and 4 are asymptotically tight. In the case 0 < a < 1, the bounds of theorems 2 

and 5 are tight up to a constant factor. To see this, it is known that n = 

+ 0 ( 1 )) and thus there exits a d{a) such that A 2 > 

d(a) — 2 — . A special case is a = 0. The graph B? is a path of length 21 

„a + l (1 + 0 ( 1 )) 

and it holds c(0) = || from theorem 5. From theorem 2 we get A 2 < ■ 

2 

However, for the path we know that X 2 ^ fp- Thus, there is a gap in a constant 
factor between the lower and the upper bound for A 2 . 



4.2 The Double-Root Trees as Worst Case Graphs 

We define the double-root tree Td^i as follows. It has a central edge c, each vertex 
of the tree has either degree 1 or d, and the distance between c and any vertex 
of degree 1 is ? — 1. Let n = 2 ^'^'^^^^"^ denote the number of vertices of and 
we denote with L{Tdg) the Laplacian of Tdg. 

In this paper we make use of the proof of a lemma in [9] , where a lower and 
upper bound is given for the second smallest eigenvalue of the double-root tree. 
In [9] the authors show that the entries of the eigenvector to the second smallest 
eigenvalue oi L{Tdg) depends only on the levels Indeed for any I < i < Ht holds 
= —x]. Moreover Tdi belongs to the class L{g,l) with g{i) = [d — 1)* and 
< 00 and therefore we can use theorem 4 and obtain the 

following theorem 
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Theorem 6. The smallest non-zero eigenvalue A 2 of the Laplacian L(Td^i) sa- 
tisfies 

^ . d-2 

Theorem 6 shows that the bound of equation (10) is asymptotically tight. 
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Abstract. Two algorithms for threading directed Eulerian mazes are 
described. Each of these algorithms is performed by a traveling robot 
whose control is a finite-state automaton. Each of the algorithms puts 
one pebble in one of the exits of every vertex. These pebbles indicate an 
Eulerian cycle of the maze. The simple algorithm performs 0(|U| • |E|) 
edge traversals, while the advanced one traverses every edge three times. 
Both algorithms use memory of size O (log dout(u)) in every vertex v. 



1 Introduction 

We consider the problem of traversing a directed finite Eulerian maze, G(V,E), 
by a robot R whose movements are controlled by a finite automaton. Initially 
R contains no information about the maze, other than the fact that it is at a 
vertex of the maze, called the root. R’s duty is to traverse every edge of the 
maze, and in the process to leave signs which indicate an Eulerian cycle; i.e. a 
cycle in which every edge appears once, and only once. 

It is assumed that the vertices carry no names. R initially marks the root 
as r, but no other vertex will be given a name. Yet, in each vertex, the exits 
to the outgoing edges are cyclically numbered. If the outdegree of vertex v is 
dout(v), then the exits to the edges are marked 1,2, .. . ,dout{v)- The entrances 
(of incoming edges) are not marked, and R, when entering a vertex has no 
awareness of which entrance it has come through. We call this situation non- 
input-awarness in contrast to input-awareness in which R knows from which 
entrance it has come through. 

In 1967, Michael Rabin [7] considered the problem of threading undirected 
connected graphs by means of a finite automaton which has some fixed number of 
pebbles it can leave at vertices and which it can move from one vertex to another. 
He proved that no such automaton can thread all finite undirected graphs. Blum 
and Sakoda [2] extended the result. It follows that no finite automaton threads 
all finite, degree bounded, connected graphs. 

* Supported by the Fund for the Promotion of Research at the Technion. 
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In view of this impossibility result, it is no wonder that the classical (19-th 
century) algorithms for threading undirected mazes, such as that of Tremaux’s 
and Tarry’s (see, for example Even’s book [4], Chapter 3) store data in the 
vertices. More about threading undirected mazes can be found in the book of 
Hemmerling [6] . However, these algorithms are not easy to implement on directed 
graphs due to the fact that backtracking edges is not allowed. 

Notice that every connected undirected graph can be transformed to a direc- 
ted Eulerian graph by replacing every edge u — v with two directed edges u — > v 
and V — > u. Thus, the impossibility result for undirected graphs implies the im- 
possibility for directed Eulerian graphs. 

Therefore, in order to thread all finite directed Eulerian graphs, it is natural 
to allow our robot, i?, to store some information in the vertices. 

Even, Litman and Winkler [5] presented an algorithm of time complexity 
0(|Ep) to thread directed networks. However, their algorithm assumes a diffe- 
rent computational model — each vertex is a finite automaton and the directed 
edges are communication links. Afek and Gafni [1] present a fairly complicated 
solution which uses essentially Depth-First Search (like Tremaux’s algorithm), 
and includes a method to effectively backtrack on directed edges. Their method 
uses 0{\V\ ■ \E\ + \ V\^ ■ log |E|) edge traversals. We shall later say more about 
the comparison of our results with those of Afek and Gafni. 

Deng and Papadimitriou [3] use a more powerful model. They assume that 
vertices have distinct identities and that the robot is a full power computer 
(such as a Turing machine). They describe a recursive algorithm, for the case 
of directed Eulerian graphs, which traverses each edge exactly twice, and they 
build on this algorithm to attack the problem for general strongly connected 
directed graphs. 

We describe two algorithms. The first, simple algorithm, uses one pebble in 
each vertex v. The pebble’s task is to mark one of the exits. Alternatively, this 
is like having a memory of 0(log do„t(u)) at v. The algorithm uses also a one bit 
memory at each vertex to remember if all its outgoing edges have been traversed. 
This variable is initially assumed to be set to NEW, and later R will switch it to 
OLD. The pebble is moved by R from the exit it is about to use, to the next one 
in a round-robin fashion. This is the only mechanism used for deciding on the 
order in which edges are traversed. R carries a flag which, in conjunction with 
the NEW/OLD signals at the vertices helps R decide when to halt. The Simple 
algorithm may use 0{\V\ ■ |A|) edge traversals before it halts. 

Afek and Gafni argue that their scanning method is essentially optimal, since 
there are (nonEulerian) directed graphs which require f?(|y|-|if|) edge traversals. 
However, as we shall see in Sect. 2.3, there are (Eulerian) graphs for which any 
Depth-First Search approach, including that of Afek and Gafni, takes I2(|y| • |E |) 
edge traversals, while ours takes 0(|Ep). 

Next, we describe a generic algorithm which has two parts. Preparatory and 
Stroll. Preparatory does not adhere to the traveling robot model, and is non- 
deterministic. During Preparatory a pebble is placed in each vertex at one of 
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its exists. Stroll is performed by a traveling robot R. It uses the pebble, placed 
initially by Preparatory, exactly as in the Simple algorithm. When R has tra- 
versed all outgoing edges of the root, it halts. It is proved that i?’s route is an 
Eulerian cycle, and that when it is done the pebbles are back where Preparatory 
put them. 

Finally, we describe an advanced algorithm. It is deterministic and is perfor- 
med by a traveling robot. Its end result is in accord with Preparatory. It uses 
two more pebbles per vertex, as well as two variables that use 3 bits of memory 
in each vertex. Each edge is traversed exactly 3 times. 

The Recursive algorithm of Deng and Papadimitriou can also be performed 
by a traveling finite robot. In this case if we allow input awareness then the 
algorithm will have the advantage that it scans every edge only twice . However, 
the memory in each vertex v will be O{dout{v) ■ logdout{v)) . If the traveling 
robot has no input awareness, then each edge is scanned exactly 3 times and the 
memory in each vertex v is O{dout{v) - logdoutiv)) ■ This was shown by Tayar [8]. 

It is worth mentioning that all three algorithms (Simple, Advanced and Re- 
cursive) may yield different Eulerian cycles, as they do if applied to the maze 
shown in Fig. 3. 



2 The Simple Traversal Algorithm 



Let G{V, E) be a finite Eulerian directed graph. Namely, G’s underlying undi- 
rected graph is connected, and for every vertex v € V, dm{v) = dout(v); i.e. its 
indegree is equal to its outdegree. Euler’s Theorem implies that G has a directed 
cycle in which every edge of G appears exactly once. 

Since di„{v) = dout{v) we shall simply use d{v) to denote dout{v)- In each 
vertex v € V, the d{v) exits to its outgoing incident edges are numbered 1,2,... , 
d{v). 



2.1 The Algorithm 

The traversal of the graph is to be performed by a finite-state robot R. The algo- 
rithm which governs the behavior of R consists of two mechanisms: The scanning 
mechanism, which determines the order in which the edges are traversed, and 
the halting mechanism. 



The Scanning Mechanism In each vertex v there is one pebble, initially 
placed at exit No. 1. The robot, when in vertex v, leaves through the exit marked 
with the pebble, but before going out R moves the pebble to the next exit, in a 
round-robin fashion. 

Clearly, the scanning continues indefinitely, unless the halting mechanism 
stops it. 
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The Halting Mechanism For the purpose of deciding when R stops scanning, 
the vertices are marked as follows: 

— The vertex in which R is initially placed is called the root and is marked r. 
This label is never changed. 

— Every other vertex u r is initially marked NEW. (No need to have a vertex 
name.) This label is eventually changed to OLD and will never change again. 

In addition, the robot carries a flag F. The flag may be UP or DOWN, and its 
initial value is DOWN. Upon reaching a vertex v the marking of v and the flag 
are changed as shown in Table 1. 



Table 1. The halting mechanism. 



if V ^ r then do 

ifv is marked NEW then do 
F ^ DOWN 

if the pebble is at exit No. d{v) then 
set mark to be OLD 

if V = r and the pebble is at exit No. Ithen do 
(This event is called the end of the phase) 
ifF=\JP then HALT 
E ^ UP 



2.2 Validity 

Theorem 1. The robot R will halt (at the root r) after having traversed all 
edges of G. This will happened within 0(^|U| • \E\) edge traversals. In its last \E\ 
edge traversals R makes an Eulerian tour. 

Note that upon termination the pebbles are not necessarily back in their 
initial condition. In fact, their position is exactly what has been “learned”. If 
now the traversal is restarted, with F = UP, R will perform an Eulerian tour 
and stop, and the pebbles will again be in the “learned” position. 

Proof. Note the definition of a phase, as in Table 1. First, let us examine the 
route of R during the first phase. Clearly, every edge incident with r has been 
traversed exactly once. Furthermore, every edge on the route of the first phase is 
traversed once only. This is easily proved by induction on the order in which the 
edges are traversed: When a vertex u r is entered by an edge (traversed for the 
first time) the number of used incoming edges is one greater than the number 
of used outgoing edges. Thus, the exit in which the pebble is placed leads to an 
edge which has not been traversed yet. 

Note that if during the first phase every incident edge of u yf r is traversed 
then v’s label is OLD. Other vertices are still marked NEW. However, for every 
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vertex the number of untraversed incoming edges is equal to the number of 
untraversed outgoing edges. 

Now consider the second phase. As long as the route goes through vertices 
labeled OLD, the route of the second phase is identical to that of the first phase. 
This follows from the fact that at the end of the first phase, the pebble is back 
at exit No. 1. If not all edges of G have been traversed in the first phase, there 
must be vertices on the first route for which some of their outgoing edges have 
not been traversed and such vertices are still labeled NEW. This follows from 
the fact that G is strongly connected. If this is the case, let v be the first vertex 
labeled NEW which is encountered in the second phase. A new tour starts at v, 
using only edges untraversed in the first phase, and ending in v. Now v is marked 
OLD and its pebble is at exit No. 1. The route of the first phase is resumed. 
Since every vertex which is still marked NEW and which is on the route of the 
first phase is similarly treated, at the end of the second phase all vertices on 
the first route are now labeled OLD. Every traversed edge has been traversed 
exactly once in the second phase, which is the second time for those of the first 
route. 

In the following phases, the algorithm will maintain (as in the first two pha- 
ses) the following property: at the beginning of every i'th phase, in every OLD 
vertex , the pebble’s location is exactly where it was at beginning of the {i — l)'st 
phase. This ensures that R’s behavior in the i'th phase will be similar to that in 
the second phase. In other words, this property ensures that R, in the i'th phase, 
will retrace the route of the {i — l)'st phase and will expand it upon reaching 
every NEW vertex ,u ,that belongs to the route of the {i— l)'st phase. After the 
expansion in u, R will exit through exit No. 1, which is, again, where the pebble 
was in the beginning of the (i — l)'st phase. 

If no NEW vertex is encountered during a phase, R will return to r with its 
flag UP, and the process is halted. During each phase, other than the last and 
possibly the first, at least one NEW vertex becomes OLD. Thus, there are at 
most \V\ + 1 phases. Hence, the number of edge traversals is 0{\V\ ■ |E|). □ 

2.3 Lower Bounds on the Simple Algorithm 

Let us consider the scanning mechanism by itself and discuss its complexity of 
edge traversals, up to the point that every edge is scanned. 

First consider a simple chain, as shown in Fig. 1. Let us denote by /s(n) the 





Fig. 1. A simple chain 



number of edge traversals, starting in vertex 1 and ending with the last edge to 
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be traversed, namely, the edge from vertex n to vertex n — 1. Clearly, /s(2) = 2 
and fs{n + 1) = /s(n) + 2n — 1. Thus, 

/s(n) = - 2n + 2. (1) 

It follows that in this case, the bound of 0(|y | • |if |) is tight. However, the simple 
chain is sparse. To see that this may happen for dense graphs too, consider the 
following directed graph Gp{V, E), where p is an odd prime. 

C = {0,1 ,--- ,2p-l}. 

The set of directed edges consists of two types: 

body 

For every two vertices, i,j < p, let = j — i mod{p). If 6{i,j) < | then 

there is an edge i — >■ j, and the exit at i is labeled 

— tail 

— There is an edge 0 — >■ p labeled . 

— For every p <i <2p — 1 there is an edge i — i + 1, labeled 2. 

— For every p < i < 2p — 1 there is an edge i — > * ~ 1> labeled 1. 

— There is an edge p — ^ 0 labeled 1. 

The case of G5 is shown in Fig. 2. 




Note that the scanning mechanism, starting in vertex 0, scans all edges 

of the body; i.e. the edges between vertices whose name is less than p, before it 
takes the edge to p. In fact, every time it returns to vertex 0, via the edge from 
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p, it will repeat the scan of all edges of the body. This will occur p times. Thus 
the whole tour until the last edge (from 2p — 1 to 2p — 2) is traversed is of length 



p{p- 1) 

P ^ 



2p + 2. 



Again, this is of the order \V\ ■ \E\, and Gp is dense. 

Finally, we examine an important property of the Simple algorithm: if in a 
constant number of phases all vertices are visited, the edge traversals complexity 
is 0{\E\). There are many Eulerian graphs in which R will pass through all 
vertices in a constant number of phases. In such cases, this property makes the 
Simple algorithm at least as good as any DFS based algorithm. In particular, 
there are graphs in which the Simple algorithm is superior. 

An example in which the Simple algorithm has a better edge traversals com- 
plexity is its execution on the a directed graph which is composed of a direc- 
ted cycle with a self-loop in each vertex (assume that for every vertex u, exit 
No. 2 leads to u’s self-loop). The Simple algorithm’s edge traversal complexity 
is 0(|E|). In the first phase R visits all vertices and in the second phase an 
Eulerian tour is performed. However, if we execute any DFS based algorithm on 
the same graph, R will have to make l7(|Ep) edge traversals. 

Another example of its superiority is in the case of a complete directed graph 
of n vertices; namely for every two vertices a and b there is an edge a — > b. In 
the first phase all vertices are visited, since the root has an edge to every other 
vertex. Therefore, all edges not traversed in the first phase are traversed in the 
second, and the third phase is the last phase. Thus, the total number of edge 
traversals is 0{\V\^). However, if one applies a DFS based algorithm, it takes 
I7(|y P) steps. (The DFS tree is a directed path of length \V\ — 1, and each time 
a back-edge is traversed, the tree must be traversed again, at least up to the 
parent of the start vertex of the back-edge.). 



3 A Generic Traversal Algorithm 

In effort to construct a more efficient algorithm for finding an Eulerian directed 
cycle in G{V,E), in terms of the number of edge traversals, we describe first a 
generic algorithm. This is a nondeterministic algorithm which deviates from our 
computational model; i.e. it is not limited to actions performed by a traveling 
finite automaton. Later on we show how to remove the nondeterminism and run 
the algorithm by means of a finite automaton which threads the directed graph. 

3.1 Definitions 

— An edge is new if it has not been traversed yet. It is old otherwise. 

— A vertex is new if all its outgoing edges are new. It is old otherwise. 

— An exploration from vertex u is a directed path which starts in v and 
continues via a new directed edge, as long as there is one. It is assumed that 
in each vertex exits are chosen consecutively according to their exit numbers, 
beginning with the least exit number which leads to a new edge. 
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3.2 The Preparatory Algorithm 

Consider Preparatory, as stated in Table 2. Note that the first exploration ends 
at the vertex it has started. This observation of Euler, follows from the fact that 
when another vertex u is entered, the number of old incoming edges is greater 
(by 1) than the number of old outgoing edges, and since din{u) = dout{u), there 
is at least one new outgoing edge. When the first exploration ends, for every 
vertex the number of new incoming and outgoing edges is the same, and this 
invariant holds every time an exploration ends. 



Table 2. Preparatory 



choose a vertex r 

run an exploration from r 

while there are old vertices which have new exits do 
choose such a vertex u 
run an exploration from u 
for every vertex v do 

i/only one exploration passed through v 

then put v’s pebble at the first exit of this exploration 
else put the pebble at the first exit of the 

second exploration which passed through v 



For every vertex. Preparatory places a pebble at one of its exits. The position 
of these pebbles is the information learned by running Preparatory. 

3.3 The Stroll Algorithm 

After Preparatory has terminated, the tracing of an Euler cycle is to be per- 
formed by a finite automaton R. R does not need to recognize vertices, except 
the need to recognize the start vertex r. R uses the pebbles, initially placed 
by Preparatory. By using this information R can now trace an Euler cycle; see 
Table 3. 



Table 3. Stroll 



put R in r 

let e be the outgoing edge indicated by exit number 1 
(This is the exit where the pebble is located.) 

R moves the pebble to the next exit and traverses e to its endpoint 
while the vertex, R is at, is not r or the pebble is not at exit number 1 do 
let e be the outgoing edge indicated by the pebble 
R moves the pebble to the next exit and traverses e to its endpoint 
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Let us demonstrate the generic algorithm on the example shown in Fig. 3. 
For each vertex, the exits are numbered, and for convenience, all vertices, not 
just r are given names. Let us denote the Fth exit of vertex v by vi. The first 
exploration yields a cycle Ci : rl,al,cl. Assume the second exploration starts 
at a; it yields a cycle C 2 ■ a2,bl,c2. Assume the third cycle is C 3 : 62, c3. Now 
the pebbles are placed at exits rl,a2,62 and c2. This completes the operation 
of Preparatory. 

Now, during Stroll, the following Eulerian cycle is traversed: rl, o2, 62, c2, al, 
c3, 61 and cl. Note that the edges of a cycle may not be strolled consecutively, 
and they may even appear in an order different from the order in which they 
have been traversed in the exploration. 

Lemma 1. In the directed path indicated by Stroll, no edge is repeated. 

Proof. Omitted from the extended abstract. □ 



Lemma 2. Let u he a vertex and Ci, i > 1, is the cycle of the first exploration 
to pass through u. If R has traversed all edges of Ci which exit from u, then R 
has traversed all outgoing edges of u. 

Proof. Omitted from the extended abstract. □ 



Lemma 3. R traverses all edges ofC. 

Proof. Omitted from the extended abstract. □ 



Theorem 2. The robot R, moving according to algorithm Stroll, traverses an 
Eulerian cycle. 
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Proof. By Lemma 1 no edge is traversed more than once, and by Lemma 3, every 
edge is traversed. □ 

Note that after Stroll halts, the pebbles are back where Preparatory has put 
them. 

4 An Advanced Algorithm 

In this section we describe an algorithm in which the operation of Preparatory 
is performed by a traveling robot R, whose control is a finite automaton. The 
amount of memory required in each vertex v is 0(logd{v)), and each edge is 
traversed 3 times. 

Each vertex has 3 pebbles, named explore-pebble, retrace-pebble and tour- 
pebble. Initially, all three pebbles are at exit No. 1. 

There are two variables stored in each vertex v. 

visited indicates the number of cycles (explorations) which have passed 
through v; its value is 0, 1, or 2. Initially visited = 0. If its value is 2 then 
there are at least two cycles which have passed through v. 

flag indicates if v has previously been passed in the current cycle. Initially, 
flag = 0, and when R reaches v for the first time during the current exploration, 
it changes the variable flag to 1. Once all edges of an exploration are found, R 
retraces the cycle and assigns all flags on the cycle to be 0 again. Thus, the flag 
of a vertex v is changed to 1, and back to 0, as many times as there are cycles 
which pass through it. 

The vertex in which R is initially placed is the root r. 

The main procedure, which governs the actions of R, is create-eulerian- 
cycle; see Table 6. It employs two subroutines, explore (Table 4) and retrace 
(Table 5) . Also, we use two macros: 

— exit(X-pebble): R exits the vertex it is at via the exit in which the X- 

pebble is located now, but first it moves the X-pebble to the next exit, in a 

round-robin fashion. 

— number(X-pebble): the macro returns the number of the exit in which the 

X-pebble is located. 

Subroutine explore is activated in order to perform an exploration. 

If flag = 0 then the current vertex is visited during this exploration for the 
first time. If in addition, visited = I then the running exploration is not the first 
to pass through the current vertex, and the tour-pebble is put where explore- 
pebble is at, which is the first exit of the second exploration to pass through 
the vertex. If visited < 2 then it is incremented. Now flag is set to I. i? leaves 
the current vertex through the exit indicated by the explore-pebble, not before 
moving it to the next exit, in a round robin fashion. 

If flag = I then the number of the exit where the explore-pebble is at is 
checked. If it is exit No. 1, the subroutine is halted; R is back at the vertex 
where the exploration has started, and all outgoing edges are old. If it is not exit 
No. 1, then there are new outgoing edges and the exploration continues. 
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Table 4. Subroutine explore 



while TRUE 

if flag = 0 then do 

if visited — 1 then 

put tour-pebble at number(explore-pebble) 
if visited < 2 then 

visited visited + 1 
flag ^ 1 

exit (explore-pebble) 

else if number(explore-pebble) = 1 then HALT 
exit (explore-pebble) 



Table 5. Subroutine retrace 



while TRUE 

if flag = 0 then do 

if number(retrace-pebble) = 1 then HALT 
else exit (retrace-pebble) 
else do flag 0 

exit (retrace-pebble) 



Subroutine retrace is activated after the exploration of a cycle is done. Its 
purpose is to reset all flags of the vertices of the newly found cycle to 0. This is 
done as follows. 

If flag = 0 at the current vertex, it is checked if the retracing is done. This is 
indicated by checking the position of the retrace-pebble. If it is at exit No. 1, R 
is back at the vertex in which the exploration has started, and thus the retrace 
subroutine is halted. Otherwise, the retracing continues. 

If flag = 1, the flag is reset to 0, and the retracing continues. 



Table 6. Main procedure create-eulerian-cycle 



call explore 
call retrace 
exit (tour-pebble) 

while R is uot at r or number(tour-pebble) 1 do 
if number(explore-pebble) 1 then do 
call explore 
call retrace 
exit (tour-pebble) 



Note that create-eulerian-cycle is deterministic. R starts an exploration 
from r, first performing explore and then retrace. Now it is back at r, while 
all three pebbles are at exit No. 1. R behaves now as Stroll, advancing the 
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tour-pebble only, but this behavior is stopped when R hits a vertex v which 
has new outgoing edges; this is detected by the fact that the exit where the 
explore-pebble is at is not exit No. 1. When this happens, R suspends the Stroll 
behavior. It performs an exploration from v, again by first running explore and 
then retrace. While doing that, the tour-pebble of every vertex encountered on 
this cycle is moved from exit No. 1 to the first exit of the new exploration if the 
present exploration is the second to pass through that vertex. When retrace 
terminates, R resumes the Stroll behavior. 

It is easy to see that create-eulerian-cycle causes R to place the tour- 
pebbles in accord with Preparatory, and when R uses the exits where the tour- 
pebbles are at, it traverses every edge once. When create-eulerian-cycle ter- 
minates, the tour-pebbles are back where Preparatory could have placed them. 
Thus, at that time, one can use Stroll, using the tour-pebbles, to traverse an 
Eulerian cycle. 
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Abstract. We study the space and the access complexity of computa- 
tions represented by Computational Directed Acyclic Graphs (CDAGs) 
in hierarchical memory systems. First, we present a unifying framework 
for proving lower bounds on the space complexity, which captures most 
of the bounds known in the literature for relevant CDAGs, previously 
proved through ad-hoc arguments. Then, we expose a close relationship 
between the notions of space and access complexity, where the latter 
represents the minimum number of accesses performed by any computa- 
tion of a CDAG at a given level of the memory hierarchy. Specihcally, 
we present two general techniques to derive bounds on the access com- 
plexity of a CDAG based on the space complexity of certain subgraphs. 
One technique, simpler to apply, provides only lower bounds, while the 
other provides (almost) matching lower and upper bounds and improves 
upon previous well-known result by Hong and Rung. 



1 Introduction 

The substantial fraction of the cost of a computing system accounted for by 
memory has motivated the study of space-efficient computations since the early 
times of automatic computing. However, a more fundamental reason to strive 
for space efficiency arises directly from the pursuit of performance: due to basic 
physical principles, access time is bound to increase with the size of memory. 
Ultimately, smaller memory translates into faster computation, making space 
efficiency a crucial objective, even when the cost of memory is negligible. 

When a large overall memory is required, it becomes convenient to organize 
it hierarchically, into a sequence of levels whose size and access time increase 
progressively. Currently, the levels of the memory hierarchy include the register 
file, two or three caches, main memory, and disks. Compared to registers, main 
memory is a few hundred times slower and disks are a few million times slower. 
Hence, effective use of the faster levels of the memory hierarchy is becoming a 
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key concern in the design of algorithms. As in all endevours where performance 
is systematically pursued, it is important to be able to evaluate the distance from 
optimality of a proposed solution. The present paper focuses on the development 
of general techniques that can be employed to prove bounds on certain relevant 
measures related to the usage of the memory hierarchy by a computation. 

We consider the problem of finding an optimal implementation of a compu- 
tation which has been specified in terms of a Computational Directed Acyclic 
Graph (CDAG), whose vertices represent operations (of both input and proces- 
sing type) and whose arcs represent data dependencies. The degrees of freedom of 
the implementor are essentially the schedule of execution of operations, possibly 
including repetitions, and the memory management, that is, the assignment of a 
memory location to each value arising in the computation during the time bet- 
ween production and last use of such value. An extensive literature on the space 
complexity of CDAGs, typically formulated in terms of so-called pebbling games, 
has developed in the past (see e.g., [9,5,7,10,8,11,12]). A pioneering paper by 
Hong and Kung [6] begun exploring complexity in two-level memory hierarchies 
and proposed a technique to bound from below the number of accesses performed 
at the slowest level of the hierarchy for a given GDAG. To date, this paper re- 
mains the main point of departure of most lower bound analysis for hierarchical 
memory performance (see, e.g., [1,2]). Savage [13] developed an interesting refor- 
mulation of the Hong and Kung’s technique as well as an extension to multilevel 
hierarchies. (A comprehensive survey of all of the above results can be found in 
[14, Gh.10,11].) In a recent work, Bilardi and Preparata [3] proposed a new lower 
bound technique based on the notion of dicothomy-width, which is restricted to 
implementations where every operation is executed exactly once. Such technique 
has been successfully extended to distributed memory hierarchies. 

In this paper, we study both the space and the access complexity of GDAGs, 
and, in particular, expose a tight relationship between the two. Models and 
relevant quantities are defined in Section 2. In Section 3, we introduce a family 
of graph-theoretic measures that can be employed to prove lower bounds to the 
space complexity of GDAGs. Each measure is associated with a suitable class 
of permutations of the vertices, and it is defined in terms of a suitable notion 
of boundary size for the prefixes of such permutations. More specifically, the 
measure is the minimum, over all the permutations in the class, of the maximum 
boundary size of a prefix. The key link between the space complexity of a GDAG 
and each measure is established by showing that each execution of the GDAG 
must embed a permutation of the operations belonging to the associated class 
and that, at each stage of the execution, the values of nodes in the current prefix 
must be stored in memory. We show how most of the results in the literature 
on space complexity, which are derived by ad-hoc arguments for the GDAG at 
hand, can be systematically derived in the outlined framework. 

In Section 4, we turn our attention to the access complexity Qg{s) of a GDAG 
G defined as minimum number of accesses to locations with address x > s, 
over all executions of G, on a memory with locations addressed by the natural 
numbers starting from 0. First, we discuss a simple but effective technique that 
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affords the derivation of lower bounds on Qg{s) from the space complexity of 
disjoint subdags of G. Second, we show, in a sense that can be made precise, 
that Qg{s) is closely related, by both lower and upper bounds, to the minimum 
number of subdags of space complexity at most 2s that “cover” the given CDAG. 
On the side of lower bounds, our results encompass those of both [6] and [14]; on 
the side of upper bounds, we are not aware of any previous comparable results. 
In spite of their greater generality, our arguments are somewhat simpler than 
the previous ones. Among other things, they show that the concept of minimum 
set featuring in the analysis of Hong and Kung is not essential. In Section 5, we 
briefly indicate some directions for further research. 

2 Preliminaries 

Let G = {V,E) he & CDAG where the nodes represent values produced by 
unit-time operations, and the arcs represent data dependencies. For every arc 
(m, v) € E we say that u is a predecessor of v and w is a successor of u. The set of 
predecessors of a node v are the operands of the operation that produces v. The 
in-degree (resp., out-degree) of a node is the number of its predecessors (resp., 
successors). Nodes of in-degree (resp., out-degree) 0 are regarded as inputs (resp., 
outputs) of the CDAG. A computation of G specifies a particular scheduling of 
the operations associated with its nodes, which satisfies data dependences, and 
a particular memory management. 

We study CDAG computations on the RAM model with a memory of un- 
bounded size whose cells are addressed by the natural numbers starting from 0. 
A standard computation of a CDAG G starts with the values of all input nodes 
in memory and must determine the values of the output nodes by performing a 
sequence of node evaluations, where a node evaluation performs the operation 
associated with a node v, provided that the operands of v are in memory, and 
stores the result in memory. 

In general, a node can be evaluated more than once. The space needed by 
a computation is defined as the maximum number of values concurrently resi- 
ding in memory at any one time during the computation. We define the space 
complexity of G, denoted by S{G), as the minimum space over all standard 
computations of G. 

In the paper we will also refer to a slightly different class of CDAG compu- 
tations, called free-input computations, that adopt the following rule: the com- 
putation of a CDAG G starts with an initially empty memory and produces the 
value of each input node, every time it is needed, by invoking a special load 
instruction. It is easy to argue that the space complexity of the free-input com- 
putations of G, which we denote by S'free(G), is not larger than S{G), hence any 
lower bound to Sfree{G) applies to S{G) as well. We observe that 5'free(G) is also 
the measure of space captured by the Red Pebble Game model [7,14]. 

Suppose that the memory is partitioned into two levels, namely LI consisting 
of the s cells of address x G [0, s), and L2 consisting of the cells of address x > s, 
featuring respectively low and high access costs. For a CDAG G we define the 
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access complexity Qg{s) as the minimum number of accesses to L2 performed 
by any computation of G. When studying the access complexity we will refer 
only to standard computations. 



3 Space Complexity 

If we rule out multiple executions of the same operation, the evaluation schedules 
of a CD AG are in one-to-one correspondence with the topological orderings of 
its nodes. Furthermore, for a given topological ordering, say (p = 4>i4>2 ■ ■ ■ Pn, 
one can easily argue that, immediately after evaluation of node (pi, every v G 
{pi , . . . , pi} with at least one successor in {pi+i , . . . , pn} must be in memory. 
Repetition of operations greatly complicates the analysis, because the simple and 
convenient correspondence between topological orderings and valid schedules no 
longer holds, and it is difficult to identify what must be in memory at any given 
point of a schedule. 

The contribution of this section is a framework that associates arbitrary sche- 
dules (where repetitions are allowed) with suitable permutations of the vertices, 
while maintaining a grip on space requirements. Informally the approach works 
as follows. Consider an arbitrary n-node CD AG G. We introduce the notion of 
marking rule / as a criterion to associate with each node u of G a family /(u) 
of subsets of its successors. Given a computation of G, the marking rule can 
be employed to “mark” exactly one evaluation for each node of G. The marked 
evaluations, in order of execution, define a permutation p = p\p 2 ■ ■ ■ pn of the 
nodes of G with the following property. For 1 < i < n, there is a time during 
the computation such that every pj with j < i and with at least one of the 
subsets belonging to f{pj) included in {pi+ii ■ ■ ■ , Pn} must be in memory. (Such 
pj's are said to form the boundary of the prefix.) We should observe that, when 
applied to a given CDAG G, a marking rule provides a lower bound to Sfree{G). 
In general, different rules generate different classes of permutations as well as 
different types of boundaries, often with opposite impact on the lower bound 
measure. The approach outlined above is formalized next and then applied to 
some well-known CD AGs. 



3.1 A General Lower Bound 

Let G = (V,E) be an n-node CDAG. Let also O CV denote the set of output 
nodes, and N{v) the set of successors of v, for every v G V. A marking rule for 
G is a function / which maps every v GV to a family of subsets of N{y) such 
that it V G O then f{v) contains only the empty set, while if v G V\0 then f{v) 
cannot contain the empty set and must contain at least one (nonempty) subset of 
N(v). In the paper we will often refer to a particular marking rule, the singleton 
marking rule where for every v G V\0 /q*"®^(u) = {{u} : u G N{v)}. 

A permutation p = pip 2 ■ ■ ■ pn of the nodes of G is an /-marking of G i/f 
for every 1 < z < n there exists q G f{pi) such that q C {pj : i < j < n}. 
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Moreover, we define the i-boundary of any f -marking 4> as the set 

Bl{i) = {ve{(j)i,...,(l)t}\0 : 3qe f{v) s.t. qC{(j)^+l,...,(j)n}} ■ 

In other words, B^{i) represents the set of nodes v G I^\0 such that vcpi+i ■ ■ ■ 4>n 
is the suffix of a legal /-marking of G. 

The following theorem shows a relation between the space complexity of the 
free-input computations of G and the size of the boundaries of its /-markings. 
Let Fg denote the set of marking rules for G and, for / G Fq, let <?(/) denote 
the set of /-markings of G. 

Theorem 1. The space complexity of the free-input computations ofG satisfies: 

Sbee{G) > max rnin rn^ \B^Ji)\- 
f&Fc i<*<" ^ 

Proof. Consider an arbitrary function / G Fq and a T-step free-input computa- 
tion G of G. Let vt be the node evaluated at step t of C, for 1 < t < T. We can 
construct an /-marking /> = ^i (/2 • • • of G by sweeping G backwards through 
the following loop. 

j = n; 

for t = T down-to 1 do 

if (vt^ {4>j +!,■■■, <fn}) and (3q€f{vt) : g C {(/j+i, . . . , 
then {set cj)j=vt\ 

It is easy to verify that at the end of the loop the resulting sequence </> forms 
indeed an /-marking of G. For 1 < j < n let r(j) denote the index such that 
the node (j>j is set at iteration r(j) (hence 4>j = Vr(j)). For an arbitrary index i 
we now show that the values of the nodes in B^{i) must be in memory at the 
end of step r(z) of G. Consider, in fact, a node (fj G B^{i). The definition of 

B^{i) implies that j < i and that (</i+i, . . . , includes a subset q G f{4>j). 
Let fc > z -I- 1 be the smallest index such that (pk G It is easy to see that 
T{j) < t(z) < r{k) and that for every r(j) < t < r{k) Vt yf pj. Hence the value 
of node <j)j, which is used to evaluate pk, must be in memory at step t(z). Since 
i was chosen arbitrarily, we may conclude that the space required by G is not 
less than maxi<i<„ |H^(z)|. The theorem follows by minimizing over all possible 
(f> G <P{f) and by maximizing over all possible / G Fg- □ 



3.2 Application to Known CDAGs 

In this subsection we illustrate how the marking approach constitutes a uniform 
and effective framework for proving lower bounds on the space complexity of 
CDAGs. Specifically, we will show how most of the lower bounds known for 
relevant CDAGs can be obtained by applying Theorem 1. 
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Complete binary tree. Let G = (V, E) be a complete binary tree with n = 2m— 1 
nodes, m leaves and arcs directed from each internal node to its parent. Thus, 
there are m inputs (the leaves) and one output (the root). The following argu- 
ment is a reformulation of the one given in [9], within the marking framework. 
Let <l> = (j>i4>2 • • • <()n be an arbitrary *"®^-marking of G and let 4>i4>2 ‘ ‘ ' 4>j be 
the smallest prefix that includes all leaves. Hence (j)j must be a leaf. It is easy 
to see that all nodes in the unique path tt from father to the root must 
be included in the suffix <pj+i ’ ’ ' 4’n- There are logm -|- 1 paths from distinct 
leaves to the root which intersect only at tt. Since all leaves are in the prefix 
and all nodes of tt are in the suffix, every such path includes a distinct node in 

j(sing) 

(j). By Theorem 1, we get 

-S'free(G) > logm-k 1, 
which is tight, as shown in [9]. 

Pyramid. Let G = (V,E) be a pyramid CDAG with n = m{m + l)/2 nodes 
obtained by taking the half of an m x m mesh above and including the nodes 
on the main diagonal, and by directing all edges towards the upper-right corner 
of the mesh. The graph has m inputs (the nodes on the mesh diagonal) and one 
output (the upper-right corner of the mesh). Let ^ be an arbitrary 

/q “®)-marking of G. Let also 4’i4>2‘ ■ ■ 4>j be the smallest prefix that includes all 
inputs. Hence (f>j must be an input. By reasoning as for the binary tree (thus 
reformulating the argument in [5] within the marking framework) we can show 

Asing) 

that (j)| > m. By Theorem 1, this implies 

5'free(G) ^ 777-, 



which is tight [14]. 



Composed CDACs. Let G = (V,E) be a CDAG and G' = {V',E') a subdag of 
G, where \V\ = n and \V'\ = m < n. Clearly, any free-input computation of G 
includes (at least) one free-input computation of G', hence S'free(G) > S'free(G'). 

This fact is reflected in the lower bound stated in Theorem 1, as explained 
below. It is easy to see that for every marking rule fc for G' there exists a 
marking rule fa for G which is an extension of fc in the sense that faW) = 
fc'iv) for every node v £ V which is not an output for G' . Moreover, the 
/c'-markings of G' are in one-to-one correspondence with the projections of the 
/^-markings of G on the nodes of V . Therefore, we have that 



max min max \B^P {i)\ > max 
/ce^G 0 G<?(/g) !<*<» /g'S-Pg' 



min 



max |H^=^'(z)|. 
Ki<m ^ 



( 1 ) 



Let G be the FFT CDAG with m inputs, m outputs, TO(logm + 1) nodes 
and arcs directed from the inputs towards the outputs (a formal definition can 
be found in [14]). Such a CDAG contains several copies of the m-leaf complete 
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binary tree G' as subgraph. Relation 1 implies that the marking framework 
provides a lower bound S'free(G) > log m+ 1 which, as argued in [14], is tight. As 
another example consider a diamond CD AG G with one input, one output, and 
n = m? nodes, obtained by gluing together two (m(m+ l)/2)-node pyramids at 
the inputs (corresponding inputs are coalesced into one node) and by reversing 
the direction of the arcs in the top pyramid. Again, Relation 1 implies that the 
marking framework provides a lower bound S'free(G) > m, which is easily seen 
to be tight. 

Finally, we want to show that marking rules can be defined in such a way to 
bring forward the presence of space demanding subgraphs within a CD AG, and 
that, in fact, in some cases this is necessary in order to obtain good lower bounds. 
Let G be the n = 2m-node CD AG formed by an m-node diamond CD AG D and 
an m-node linear array L, connected as follows (see Figure 1). Let wi, V 2 , ■ ■ ■ , Vm 
be a topological ordering of the diamond nodes and let mi , M 2 , • ■ • , Um be the 
(unique) topological ordering of the array nodes. In addition to the arcs of the 
diamond and those of the linear array, E contains the arcs {vi,Um-i+i), for 
1 < z < m. Therefore, G has one input, namely the diamond input Vi, and 
one output, namely the linear array output Um- R is easy to see that the space 
complexity of the free-input computations of G is dominated by that of the 
diamond component. By selecting f G Fq as a, suitable extension of one 

can prove 5'nee(G) > ^/rn, which is tight. However, the following claim shows 
that the singleton marking rule /g relative to the entire CD AG G yields only 

a trivial bound on S^eeiG). 

filing) 

Claim, min maxi<j<„ {i)\<2. 

Proof. Let 4> = 4>\4>2 ■ ■ ■ 4>2m be such that for 1 < z < m 



4>2i-l — Vm-i+1, 
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4 ^ 2 % — 



(sin ) filing) 

It is easy to see that (p is an -marking. Moreover, (1) 

for 1 < i < TO, 



{vm} and. 



(2i) = {ui}, 

rising) 

(2i + l) = {vjn-i,Ui}. 



Thus, the lower bound provided by is at most 2. 



□ 



Unlike the marking rule / mentioned above, which coincides with on the 

diamond, is unable to bring forward the complexity of the diamond due 

to the “disturbance” introduced by the arcs between D and L. 



4 Access Complexity 

The notion of access complexity (closely related to the I/O complexity studied in 
[6]) was introduced in [3] to capture the cost of CD AG computations in memory 
hierarchies by counting the number of accesses directed above a certain level. In 
this section we develop bounds on the access complexity of a CDAG in terms of 
the space complexity of certain subgraphs. A first method is based on a partition 
of the given CDAG G chosen a priori and relates the access complexity of G to the 
space complexity of the subgraphs of such partition. The method is conceptually 
simple to apply, since only the chosen subgraphs need to be analyzed, and, as an 
example shows, it may be employed successfully to get tight bounds. Building 
upon [6,13], the remaining two subsections develop a more sophisticated analysis 
which, while possibly more involved when studying individual CDAGs, does 
provide almost matching lower and upper bounds on the access complexity of 
a CDAG G in terms of the minimum cardinality of certain coverings of G into 
(non necessarily disjoint) subgraphs with limited space complexity. 

4.1 The Prepartition Approach 

We begin with a simple observation. A standard computation C of a CDAG 
G can be transformed into a free-input computation G' which starts from an 
initially empty memory and uses the same schedule of operations as G except 
that every node evaluation that requires an input as operand is preceded by a 
load of that input. It is easy to see that at least S'free(G') distinct memory cells 
are accessed during C" and, therefore, during G. Since max{0, S'free(G') — s} of 
these cells must belong to L2 we have: 

Qg(s) > max{0, S'free(G') - s}. 

The following theorem generalizes this argument. 
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Theorem 2. Let G be h node-disjoint subgraphs Gi, G 2 , • • ■ , Gh with space com- 
plexities (for free input computations) SfreeiGi), 1 <i < h. We have: 

h 

Qg{s) > y^max{0, Sf,ee{Gj) - s}. 

i=l 

Proof. Consider a standard computation G of G and let Ci be the subcomputa- 
tion of G relative to subgraph Gi, 1 < i < h. By reasoning as before we can easily 
conclude that at least S'free(G'i) distinct memory cells are accessed to read/write 
values of nodes of Gi. This accounts for at least max{0, S'free(Gi) — s} accesses 
to L2 relative to nodes of Gi, and the theorem follows. □ 

As an example, consider a diamond CDAG G with n = nodes. Note that 
G can be partitioned into 0 (n/ (4s^)) node-disjoint diamond subgraphs with 4s^ 
nodes and space complexity (relative to free-input computations) 2s each, thus 
yielding Qg{s) = Q{n/s). Moreover, one can easily orchestrate a computation 
of G which indeed performs O (n/s) access at addresses x > s. 

4.2 The Hong and Kung’s Lower Bound Method 

Below, we review the methodology proposed by [6] for proving lower bounds 
on the access complexity of CDAGs, based on the notion of s-partition. Let 
G = {V, E) be an n-node GDAG. A partition of V into subsets {Vi, V 2 , . . . , V),} 
is called a topological partition if, for each (v, v') G E, v G Vi and v' G Vj implies 

i < j- 

Definition 1. An s-partition of G is a topological partition {Vi, V 2 , . . . , 14} of 
subsets of V such that: 

1. For every 1 < i < h there is a set Di Q V of size at most s intersected by 
every path from an input to a node in Vi . Di is said to be a dominator set 
for Vi. 

2. For every 1 < i < h the set Mi C Vi of nodes of Vi with no successors in Vi 
has size at most s. Mi is called the minimum set of Vi. 

The following theorem is proved in [6, Theorem 3.1]. 

Theorem 3. Let ha{2s) be the minimum cardinality of a 2s-partition for G. 
Then, the access complexity of G satisfies 

Qg{s) > s{hG{‘2s) — 1). 

We observe that the above result, although successfully employed in [6] to prove 
strong lower bounds for several GDAGs, is rather weak if applied to GDAGs 
with few inputs and outputs. Specifically, if a GDAG G has at most 2s inputs 
and at most 2s outputs, as is the case, for example, of the diamond GDAG, the 
entire node set V has a dominator set (the set of inputs) and a minimum set (the 
set of outputs) of size at most 2s, hence it forms an 2s-partition of cardinality 
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hc{2s) = 1. In this case the s(/ig(2s) — 1) lower bound on the access complexity 
becomes meaningless.^ 

Savage [14] presents a version of the Hong and Kung result, indicating that 
his version is weaker but simpler to state and prove. Strictly speaking, Savage’s 
version is not comparable with Hong and Kung’s (i.e., each version might lead 
to stronger bound than the other one on suitable CD AGs). In the following 
subsection we further develop the ideas of both [6] and [14], achieving a result 
strictly stronger than both, and show that in an interesting sense the resulting 
bound is tight. 

4.3 The Covering Method: Lower and Upper Bounds 

Let G = (V,E) be an n-node CDAG. We observe that, when partitioning V 
into subsets, the crucial measure for a subset U is the amount of space needed 
to compute the nodes in Vi starting from a suitably chosen subset Di of their 
ancestors (which is easily seen to be a dominator set of Vi). Generally, in the 
process, a larger set Ui of nodes is evaluated. 

Definition 2. A topological partition {Vi, V 2 , . . . , 14} of V is s-coverable if for 
1 < i < k, there exists a set Ui, with Vi Q Ui Q such that SfUf) < s, 

considering as inputs of Ui all nodes of in-degree 0 or with a predecessor outside 
Ui- (Note that the inputs ofUi are a subset of the inputs ofG.) 

Theorem 4. Let kc{‘^s) be the minimum cardinality of a 2s-coverable partition 
for G. Then the access complexity of G satisfies 

Qg{s) > s(/cg(2s) - 1) 

Proof. Consider a T-step computation of G. For every v G V we mark the step of 
its first evaluation (inputs are all marked at the beginning of the computation) . 
Clearly, each node of V is associated with exactly one marked step. Partition the 
interval [1,T] relative to the computation of G into K subintervals such that in 
the first K — 1 subintervals exactly s accesses to L2 are performed, and in the 
K-th subinterval at most s accesses to L2 are performed. For 1 < i < AT we let 
Vi consist of the nodes associated with the steps marked in the t-th subinterval 
li. It is easy to see that {14,14, ■ • ■ , Vk} is a topological partition. To show that 
it is a 2s-coverable, we let Ui be the union of two (possibly nondisjoint) sets of 
nodes: the set of nodes whose values, at the beginning of 4, reside in LI or in 
the (at most) s cells of L2 accessed during Ip, and the set of nodes evaluated 
during li. It is easy to see that the inputs of Ui are included in the former set 
and that Vi U Ui U Moreover, it is easy to argue that there exists a 

standard computation of Ui that uses space at most 2s. The theorem follows 
since the evaluation of G performs at least s{K — 1) accesses to L2 and K is 
bounded from below by ko(2s). □ 

^ In fact, an Q (n/s) lower bound on the access complexity of a variant of the n-node 
diamond is proved in [6], assuming that all 2,/n — 1 nodes on the upper sides of the 
diamond are inputs. 
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The following theorem provides an upper bound on the access complexity of 
general CD AGs. 

Theorem 5. Let G have a 2s-coverable partition of cardinality k. Then, 

Qg(2s) < 4sfc. 

Proof. Let {Vi, V 2 , . . . , Vk} be a 2s-coverable partition of G. Let LI be the set 
of memory locations of address smaller than 2s and L2 the set of the remaining 
locations. We can compute G in k stages, where in Stage i the following activities 
are performed, 1 <i <k: 

1. Download the inputs of Ui from L2 to LI (except for Stage 1); 

2. Compute Ui in LI, uploading to L2 the value of every node v belonging to 
some Uj, with j > i, that was not uploaded in previous stages. 

The correctness of the computation follows immediately from the definition of 
2s-coverable partition. Moreover, it is clear that each input of Ui contributes at 
most two accesses to L2 (one associated with its upload and one at the beginning 
of Stage i), while nodes that are not inputs of some Ui never pass through L2. 
Let Di C Ui be the set of inputs of Ui and note that by definition of 2s-partition 
|I?i| < 2s. We have that the access complexity of the computation is at most 

□ 

As a straighforward corollary we have that Qg{2s) < Aska{2s). Moreover, 
from Theorem 4 we have that Qa{s) > s (fee (2s) — 1)- As observed in several 
studies (e.g., [1]), relations on Qq of this type translate into tight upper and 
lower bounds on running times for memory hierarchies where access time varies 
smoothly with the address. 

5 Conclusions 

In this paper, we have studied some issues related to space and access comple- 
xity. We have developed a lower bound technique for space of wide applicability. 
One interesting open issue remains whether our technique could reproduce the 
l7(n/logn) lower bound obtained in [11] for a particular CDAG recursively con- 
structed out of superconcentrators. Attempts to resolve this issue might shed fur- 
ther light on the relation of space complexity to other graph-theoretic properties 
of CDAGs. We have also explored various relations between access complexity 
and space complexity. Here the attention was focused on minimizing Qg{s) for 
a fixed value of s. When studying computations in hierarchical memory models, 
such as the HMM of [1] or the HRAM of [3], the objective is the minimization 
of running time, which can be viewed as a functional of the function Qg{')- A 
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number of interesting questions arise in this connection, which can benefit of our 
results on the access complexity, but also from a deeper investigation of the pos- 
sible shapes for the function Q (e.g., see [4] for issues related to the portability 
of performance across different hierarchical machines). 
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Abstract. Using the specific structure of the minimal separators of AT- 
free graphs, we give a polynomial time algorithm that computes a trian- 
gulation whose width is no more than twice the treewidth of the input 
graph. 



1 Introduction 

The treewidth of graphs, introduced by Robertson and Seymour [11], has been 
intensively studied in the last years, mainly because many NP-hard problems 
become solvable in polynomial and even in linear time when restricted to graphs 
with small treewidth. These algorithms use a tree-decomposition of small width 
of the graph. A tree-decomposition or a triangulation of a graph is a chordal 
supergraph, i.e. all the cycles of the supergraph of length strictly more than 
three have a chord. Computing the treewidth of a graph corresponds to finding a 
triangulation with the smallest cliquesize. In particular, we can restrict ourselves 
to triangulations minimal by inclusion, that we call minimal triangulations. 

Computing the treewidth of arbitrary graphs is NP-hard. Nevertheless, the 
treewidth can be computed in polynomial time for several well-known classes of 
graphs, for example the chordal bipartite graphs, the circle and circular-arc gra- 
phs, and permutation graphs. All these algorithms use the minimal separators 
of the graph and the fact that these classes of graphs have “few” minimal sepa- 
rators, in the sense that the number of the separators is polynomially bounded 
in the size of the graph. By studying the potential maximal cliques of a graph, 
which are the maximal cliques appearing in at least one minimal triangulation 
of the graph, we proved in [3,4,5] that the minimal separators are sufficient to 
compute the treewidth of a graph. 

It has been proved by Bodlaender et al [2] that the treewidth can be appro- 
ximated within a 0(logn) multiplicative factor. In addition, they showed that 
the treewidth is not approximable within an additive constant unless P = NP. 

The existence of a polynomial time approximation algorithm which is no 
more than a constant times the optimal value is a question that still remains 
open. In this paper, we consider this problem for the class of AT-free graphs. 
Computing the treewidth is still NP-hard for these graphs [1]. Using the particu- 
lar structure of their minimal separators, we give a 2-approximation algorithm 
for the treewidth of AT-free graphs. 



U. Brandes and D. Wagner (Eds.): WG 2000, LNCS 1928, pp. 59—70, 2000. 
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2 Preliminaries 

Throughout this paper we consider connected, simple, finite, undirected graphs. 
Let G = {V, E) be a graph. For a vertex set V' CV of G, we denote by Na(V') 
the neighborhood of V in G\V . 

A graph H is chordal (or triangulated) if every cycle of length at least four has 
a chord. A triangulation of a graph G = (V, E) is a chordal graph El = {V, E') 
such that E C E' . H is a, minimal triangulation if for any intermediate set E” 
with E C E" C E', the graph (V, E") is not triangulated. A clique of G is a 
complete subgraph of G. 

Definition 1. Let G he a graph. The treewidth of G, denoted by tw(G), is the 
minimum, over all triangulations H of G, of uj{H) — 1, where to{H) is the the 
maximum cliquesize of H. 

In other words, computing the treewidth of G means finding a triangulation 
with smallest cliquesize. In particular, the treewidth is always achieved by some 
minimal triangulation of the graph. 



2.1 Minimal Separators and Chordal Graphs 

The minimal separators play a crucial role in the characterization of the minimal 
triangulations of a graph. 

A subset S' C y is an a, b-separator for two nonadjacent vertices a,b € V 
if the removal of S from the graph separates a and b in different connected 
components. S is a minimal a, b-separator if no proper subset of S separates a 
and b. We say that S is a minimal separator of G if there are two vertices a 
and b such that S is a minimal a, b separator. Notice that a minimal separator 
can be strictly included into another. We denote by Aq the set of all minimal 
separators of G. 

Let G be a graph and S a vertex set of G. We note Cg{S) the set of connec- 
ted components of G\S. A component G € Cg{S) of G\S is a full component 
associated to S if every vertex of S is adjacent to some vertex in G. For the 
following lemma, we refer to [7]. 

Lemma 1. A set S of vertices of G is a minimal a, b-separator if and only if a 
and b are in different full components associated to S. 

Let S' be a minimal separator of G. If G G Cg{S), we say that (S, G) = 
S U G is a one-block associated to S. A one-block (S, G) is called full if G is a 
full component associated to S. If (S, G) is a full one-block, then S = Ng{G). 
If (S, G) is not full, then S* = Ng{G) is a minimal separator of G, strictly 
contained in S. 

Let S be a minimal separator of G. We say that S crosses a set of vertices 
A if S separates two vertices x,y £ A (i.e. S is an x, y-separator). We say that 
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S separates two sets of vertices A and B if S separates each vertex of A\S' from 
each vertex of B\S. 

Let S and T be two minimal separators. If S crosses T, we write S^T. 
Otherwise, S and T are called parallel, denoted by S\\T. It is easy to prove 
that these relations are symmetric. Remark that S and T cross if and only if 
then T intersects each full component associated to S. Conversely, S and T are 
parallel if and only if T is contained in some one-block (S', Ct) associated to S. 
In particular, if T C S, then S and T are parallel. 

Let S G Aq be a minimal separator. We denote by Gs the graph obtained 
from G by completing S, i.e. by adding an edge between every pair of non- 
adjacent vertices of S. If C C Aq is a set of separators of G, Gr is the graph 
obtained by completing all the separators of B. The results of [8], concluded 
in [10], establish a strong relation between the minimal triangulations of a graph 
and its minimal separators. 

Theorem 1. Let B G Aq be a maximal set of pairwise parallel separators of G. 
Then H = Gr is a minimal triangulation of G and Ah = B. 

Let H he a minimal triangulation of a graph G. Then Ah is a maximal set 
of pairwise parallel separators of G and H = Gah- 

In other terms, every minimal triangulation of a graph G is obtained by 
considering a maximal set B of pairwise parallel separators of G and completing 
the separators of B. The minimal separators of the triangulation are exactly the 
elements of B. 

It is important to know that the elements of B, who become the separators 
of H , have strictly the same behavior in H as in G. Indeed, the connected 
components of H\S are exactly the same as in G\S, for every S G B. Moreover, 
the full components associated to S are the same in the two graphs. 



2.2 Blocks 

The following definitions are strongly related with the blocking sets and the blocks 
introduced in [6]. 

Definition 2. Let G be a graph and S C Aq a set of pairwise parallel separators 
such that for any S G S, there is a one-block {S, G{S)) containing all the elements 
ofS. Suppose that S, ordered by inclusion, has no greatest element. We define 
the piece between the elements of S by 

P{S)= 

ses 

Notice that for any S G S the one-block of S containing all the separators of 
S is unique: if T G 5 is not included in S, there is a unique connected component 
of G\S containing T\S. 
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Definition 3. Let B he a set of vertices of a graph G. We denote by Ci, . . . ,Cp 
the connected components of G\B and by Si the neighborhood of Ci. We will say 
that B is a block of G if the sets Si are minimal separators of G and one of the 
following conditions holds: 

- B = G. 

- There is an i & [l,p] such that B is a one-block (Si,C). 

- B = P{Su...,Sp). 

If B is a block, we say that the minimal separators Si,...,Sp border B. The 
block B is full if G = B or, for each Si, B is contained in a full one-block 
associated to Si. 

Remark 1. Observe that if {Ti , T 2 , . . . Tq} is a set of minimal separators such that 
the piece between them exists, then B = P{Ti, . . . , Tp) is a block. Moreover, any 
of the minimal separators bordering B is contained in one of T\, . . . ,Tq. Also 
if i? is a one-block (S,C), then any of the minimal separators bordering B is 
contained in S. 

A block is also characterized by the following property: 

Proposition 1. Let B he a set of vertices of G. Consider the connected com- 
ponents Ci,...,Cp of G\B and their neighborhoods Si,...,Sp. If for all i, 
1 < i < p we have that Si is a minimal separator, B is contained in some 
one-block {Si,Di) and B ^ Si, then B is a block ofG. 

Proof. Suppose that the set {Si, S 2 , . . . , Sp} has no greatest element. We show 
that P{Si , . . . , Sp) exists and is equal to B. For each Si, all the minimal separa- 
tors Si, S 2 , . . . , Sp are contained in a one-block (Si,Di), so P{Si, . . . , Sp) = 
f]^{Si,Di). In particular, B C P{Si, . . . , Sp). Conversely, let us prove that 
P{Si, . . . , Sp) is contained in B. Let a; be a vertex of G\B, so x is contained 
in some Ci. Hence, x ^ (Si,Di), so a; ^ P{Si, . . . , Sp). 

Suppose now that {S'!, S' 2 , . . . , Sp} has a greatest element, say ^i. We show 
that B = (Si,Di). Clearly, B C (Si,Di). Suppose there is a vertex x in 
{Si,Di)\B and let Ci the component of G\B containing x. We have that 
Si C Si, so Ci is also a connected component of G\Si. Since x £ Di, it follows 
that Di = Ci, so Di is a connected componend of G\B. Therefore, Di fl H = 0, 
contradicting B ^ Si. □ 

Proposition 2. Let G be a graph and S,T two minimal separators of G such 
that SffT. Then S' U T is a full block of G. 

Proof. Let B = S U T. The assertion is true ii B = G. Suppose that B ^ G. 
Consider the connected components Ci , . . . , Cp of G\B and their neighborhoods 
Si,...,Sp. 

We prove first that each Si is a minimal separator of G and B is contained in a 
full one-block (Sj, Di). The vertex set Ci is contained in a connected component 
D' associated to S and in a connected component E' associated to T. Let D, E 
be two full components associated to S, respectively T, such that D ^ D' and 
E ^ E' . Let us show that DU if is contained in a same full component associated 
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to Si- Since Ci C D' and D' is a connected component of G\S, N{Ci) is contained 
in D' \J N {D') C (S,D'). It follows that Si = N{Ci) does not intersect D. Thus, 
D is contained in a connected component Cd of G\Si. For similar reasons, E 
is contained in a connected component Ce of G\Si. It remains to prove that 
Gd = Ge- Since S'#T and D is a full component associated to S, there is an 
X G T C\ D. Then x G T has at least one neighbor y in the full component 
E associated to T. We have two adjacent vertices x G Gd and y G Ge, so 
Gd = Ge- We denote Di = Gd- Since 5^ C S' U T and D, E are full components 
associated to S, respectively T, we deduce that each vertex of Si has a neighbor 
va Di Gi D iJ E. So Di is a full component associated to Si, different from Gi- 
By lemma 1, Si is a minimal separator of G- We show now that B C (Si,Di). 
Indeed, D C Di, so S = N{D) is contained in N{Di) U Di = {Si,Di). In the 
same way, T C (Si,Di). We conclude that B is contained in the full one-block 
(Si,Di). Also notice that fl B yf 0, because Di contains D and D C\T yf 0. 
Thus, B yf S,. 

By proposition 1, i? is a full block of G. □ 

Proposition 3. Let B he a block of G and S C B be a minimal separator of 
G- For any connected component G associated to S which intersects B we have 
that {S,G) C\B is a block- In other words, if S C B crosses B, then S splits the 
block B into smaller blocks- 

We omit here the proof of proposition 3. If i? is a block S U T with S#T, 
then any full component associated to S intersects T. We obtain: 

Corollary 1. Let SffT he two minimal separators ofG- Let G be a full compo- 
nent associated to S- Then S U {T D C) is a block of G- 

3 AT-Pree Graphs and Minimal Separators 

We say that three vertices (x, y, z) of a graph form an asteroidal triple of a graph 
G if between every two of them there exists a path avoiding the neighborhood 
of the third. A graph is AT-free if it has no asteroidal triple. 

For characterizing AT-free graphs we rather use the notion of asteroidal triple 
of separators- 

Definition 4. Let S, T and U be three minimal separators of G, none of them 
being contained into another- We say that {S,T, U) form an asteroidal triple of 
separators if P{S,T,U) exists- 

Proposition 4 ([6]). A graph G is AT-free if and only if it has no asteroidal 
triple of separators- 

In particular, an AT-free graph cannot have three-blocks, that is blocks of 
type P{S, T, U) with none of S, T, U contained into another. So any block B of 
an AT-free graph is a one-block (S', G), a two-block P{S, T) or the graph G itself. 
Let us reconsider corollary 1 for the case of AT-free graphs. 
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Proposition 5. Consider an AT-free graph G, a minimal separator S of G and 
a full component G associated to S. Let T he a minimal separator crossing S. 
Suppose that G\T ^ 0 and let U = N{G\T). Then S A {T C\ G) = P{S, U). 

Proof. By corollary 1, B = S A (T nC) is a block of G. 

Let G\,G 2 , ■ ■ ■ ,Gp be the connected components of G\B. The Gfs are either 
connected components of G\S different from G or connected components of 
G[C'\r]. Let Si, . . . , Sp he the neighborhoods of Gi, . . . , Gp. Notice that, if Gj is 
a connected component of G\S different from G, then Si C S. In particular, if 
Gi is a full component associated to S with Gi yf G, we have Si = S. 

Let us show that S is maximal by inclusion in {S'!, . . . , S'p}. Suppose that S is 
strictly contained in some Si, so S C N{Ci). Then Gi is a connected component 
of G[G\T]. Let D be the connected component of G\T containing Gi. We have 
N{Gi) C DA N{D) C (T,D), so S' C (T,D). This contradicts the fact that S 
and T cross. 

Let Gi be any connected component of G[G\T] (such a Gj exists because C\T 
is non-empty). Let us show that Si % S. If Si C S, then Gi is also a connected 
component of G\S, so Gi = G. This implies i? fl G = 0, contradicting the fact 
that T intersects the full component G associated to S. It follows that Si % S. 

Thus, the set of minimal separators hording B has at least two elements 
maximal by inclusion, S and S' . Since an AT-free graph has only one-blocks and 
two-blocks, we deduce B = P{S,S'). It remains to show that S' = U. For any 
connected component Gi of G[G\T], we have Si % S, so Si C S' by remark I. 
It follows that S' contains U = N{C\T). It is easy to see that any minimal 
separator bordering B is contained in S or in N{G\T), so S' C U. We conclude 
that [/ is a minimal separator of G and S' U (T fl G) = P{S, U). □ 

Notice that the minimal separator U of proposition 5 is also one of the 
minimal separators bordering the block S U T. In particular, U is parallel to 
both S and T. Since U intersects at most one component associated to T, there 
is at least one full component E associated to T such that U fl if = 0. 

Corollary 2. Let S and T he two crossing separators of an AT-free graph G. 
Let G he a full component associated to S such that G\T ^ 0. Consider the 
block S A {T A C) = P{S, U). Let E he a full component associated to T such 
that E nU = 0. Then T fl G zs contained in the neighborhood of S A E. 

Proof We denote Ec = E AC. Let us prove that Ec = 0. Suppose that Ec is 
non-empty. The neighborhood of Eq is contained in S U T. We show first that 
N{Ec) is not contained in T. If N{Ec) Q T, then Ec is a connected component 
of G\T, so we must have Eq = E. But Ec C G, so the neighborhood of Ec is 
included in G U N{G) = {S, C). Let now ii be a full component associated to S, 
different from G and let y G T A D. We have y ^ N{Ec), contradicting the fact 
that Ec = if is a full component associated to T. Thus, N{Ec) is not contained 
in T, so there is a vertex y in N{Ec)\T. Clearly, y G E. By proposition 5, 
U = N{G\T), so y G U. This contradicts the fact that E does not intersect U. 

We have proved that if fl G = 0. Let now x be any vertex of Tc = T fl G, 
we show that x has a neighbor in Se = S AE. if is a full component associated 
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to T, so X has a neighbor x' in E. Since x G C, we have that x' is in C or in 
N{C) = S. But n C = 0, so the only choice remaining is x' € S D E. □ 

Proposition 6. Let S he a minimal separator of the AT-free graph G and let C 
be a full eomponent associated to S. Consider any minimal separator T, crossing 
S. Let x,y G C\T such that x and y have neighbors in S\T. Then x and y are 
in a same connected component of G\T. 

Proof. By proposition 5, we know that U = N(G\T) is a minimal separator and 
the block = S' U (T n C) can be written P(S, U). Recall that T and U are 
parallel. Suppose that T separates x and y and let x',y' G S\T be two neighbors 
of X, respectively y. Clearly, T separates x' and y' . But x' ,y' G N{G\T) = U, 
so T crosses U - contradiction. □ 

4 The Algorithm 

We begin by an informal description of our algorithm. We are given an AT-free 
graph G and an integer k and we want to decide if tw(G) > k or to find a 
triangulation of width at most 2k. The time complexity of the algorithm we 
obtain is polynomial and does not depend on the value of k, so it is easy to 
derive a polynomial algorithm for a 2-approximation of the treewidth of AT-free 
graphs. 

The algorithm relies on this property of AT-free graphs which will be proved 
in the next section. Given an AT-free graph G of treewidth at most k and a 
one-block (S,G) with |S| < fc and \G\ > k + 1, there is a minimal separator 
Lf C {S,C) such that \P{S,U)\ <2k+l and \U\ < k. Moreover, the separator 
U can be computed in polynomial time. 

Consider any minimal separator S' of G and any connected component G 
of G\S. We denote by R{S,C) the realization of (S, G), which is the graph 
G 5 [SUG] (see [9]). We have to show that if |S| < fc then either we can compute 
a triangulation H{S,G) of R{S,G) of width at most 2k or we can decide that 
tw(G) > k. Indeed, a triangulation of G is obtained by taking IJ^ iJ(S, G) [9]. 

The algorithm works in this way. If |G| < A: -I- 1, we simply complete (S, G) 
into a clique. If |G| > A: -I- 1, we are looking for a minimal separator U C (S, G) 
such that \U\ < k and \P{S,U)\ < 2k + 1. If such a separator does not exist 
we assert that tw(G) > k. If U exists, observe that U splits the block (S, G) 
into smaller blocks, namely the block P{S, U) and blocks of the form ({7, G') 
with G' C G. We then apply this process recursively on each sub-block ({7, G') 
to find a triangulation H{U, C') of R{U, G') of width at most 2k or output that 
tw(G) > k. The block P{S, U) will be simply completed into a clique K{S, U). 
Then H{S, C) is |Jc' H{U, C') U K{S, U). 

Finding the first separator S with lAj < k is easy, it is sufficient to look for a 
separator of minimum size. It is well-known that this computation can be done 
in polynomial time. 

More delicate is the search for the minimal separator U in the general fra- 
mework and we use a quite indirect method. We will consider the co-bipartite 
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graph G' = G{s_c’}[5' U C] obtained from G[S' U G] by completing S and G. We 
show that if tw(G) < k, there is a minimal separator X of G' of size at most k. 
Conversely, we prove that if G' has a minimal separator X with \X\ < k, the 
vertex set S' U X is actually a block P(S, U) of G, with U = Ng{G\X). The 
minimal separator U will satisfy \P{S, U)\ <2k+ 1 and \U\ < k. 

We give now the pseudo-code of the algorithm. The main program, approxi- 
mate_triangulation, uses the procedure triangulatejrealization which, gi- 
ven G, k and a one-block (S, G) computes a triangulation of R{S, G) of width 
at most 2k or outputs that tw(G) > k. 

approximate triangulation 

Input : an AT-free graph G and an integer k 

Output : tw(G) > fc or a triangulation H of width at most 2k 

begin 

compute a minimal separator S of G of minimum size 
if |S| > k then 

return “tw(G) > fc” 

for all one-blocks (S, G) associated to S do 

H{S, G) ^ triangulate_realization(G, S, G, k) 
if the call outputs “tw(G) > fc” then 
return “tw(G) > /c” 
return |J(^ H{S, C) 

end 

triangulate realization 
Input : G, S, G and k 

Output : tw(G) > fc or a triangulation H{S, C) of i?(S, G) of width < 2k 
begin 

if |S U G| < 2/c -|- 1 then 

return the clique on vertex set S U G 
/* we have to find the separator U*/ 

G' ^G[s,c}[SuC] 

compute a minimum size separator X of G' 
if |A| > k then 

/* we still have the possibility that tw (i?(S, G)) < k */ 
return approximate triangulation(i?(S, G), k) 

U ^ Ng{G\X) 

for all the one-blocks {U,G') contained in (S,G) do 
H'{U, G') ^ triangulate _realization(G, U, G' , k) 
if the call outputs “tw(G) > fc” then 
return “tw(G) > A:” 
return |J(^, H'{U, G') U Gp(^s,u) ['S' U U] 

end 

Our goal is to implement the procedure triangulate_realization in polyno- 
mial time. It remains to discuss a point we have hidden until now. It is possible 
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that the algorithm fails to find the separator U even if tw(G) < k. This case oc- 
curs only when the minimal separator S belongs to all the minimal triangulations 
of width k. So to handle this problem, we have just to decide if tw{R{S, C)) > k 
or to find a triangulation of R{S,C) of width at most 2k, that is we have to 
apply the main algorithm on the realization of the block. 

5 Main Theorem 

In this section we consider an AT-free graph G of treewidth at most k. We 
take any minimal triangulation H of G, of width at most k (so oj{H) < A: -I- 1). 
Consider now any minimal separator S' of G and a full component G associated 
to S. Suppose that S is not a minimal separator of H and that |G| > fc -I- 1. 

Under these conditions, our main result is the following: 

Theorem 2. There is a minimal separator U C (S,G) of G such that \U\ < k 
and \P{S,U)\ < |S| + k. Moreover, the minimal separator U can he computed in 
polynomial time. 

The proof of this theorem will be made in several steps. We will first concen- 
trate on the existence of the minimal separator U. Our proofs will characterize U 
and will give some further information about the block P{S, U). Then we obtain 
an algorithm computing U . 

Since S is not a minimal separator of H and Ah is a maximal set of pairwise 
parallel separators of G by theorem 1, we have at least one minimal separator 
T G Ah that crosses S. Take T among all the minimal separators crossing S such 
that T n G is maximal by inclusion. Observe that G\T yf 0, because |G| > fc -|- 1 
and \T\ < k. By proposition 5, i? is a block of G and there is some U C B such 
that B = P{S, U). We denote A = [/ U (T n G). We distinguish the “easy” case 
when |A| < k, and the “difficult” case, when |A| > k. 



5.1 Easy Case: |X| < fc 

Proposition 7. Consider the co-bipartite graph G' = G{s^c'}[S'U G] obtained 
from G[S'UG] by completing the sets S and C into cliques. Then X is a minimal 
separator of G' . 

Proof. We show that C\X and S\X are full components associated to X in G'. 

Observe that C\X yf 0. Indeed, |A| < k, and |G| > k + 1, so C\X yf 0. We 
prove that Nc'{C\X) = X. By proposition 5, the neighborhood of G\T in G is 
U. Clearly, the neighborhood of C\T in G' is (T O G) U C/ = X, so C\T = C\X 
is a full component associated to A in G'. 

Since S' is a clique of G', S\A is contained in a connected component D of 
G'\A. This component is different from G\A, and since U(G') = (S\A) U A U 
(G\A) the only choice is D = S\X. Since T and U are parallel in G, there is a 
full component E associated to T in G such that EC\U = 0. Notice that S\X 
is not empty, in particular it contains Se = S C\ E. Let us show that A is in 
the neighborhood of S\X in G'. According to corollary 2, T fl G is contained 
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in the neighborhood of Se in the graph G, so X\S = T D C is contained in 
Ng'{S\X). Also A n S' is contained in Ng'{S\X), because S is a clique in G'. 
Hence, Ng'{S\X) = X. We conclude that S\X is a full component associated 
to X in G'. 

We have proved that G'\X has two full components associated to X. By 
lemma 1, A is a minimal separator of G". □ 

So there is a minimal separator A of G" of size at most k. Conversely, we 
show that if G" has a minimal separator Y of size at most k, we will be able to 
compute a minimal separator U like in theorem 2. 

Proposition 8. Let Y he a minimal separator of the graph G' = G'{g_c'} [S, G'] . 
Then SUA is a two-block P{S,U) of G. Moreover, U = Ng{G\Y) and U is 
contained in Y . 

Proof. We denote B = S U A. We show that B is not contained in S or in 
A. Since A is a minimal separator of the cobipartite graph G' , the connected 
components of G'\Y are exactly S\Y and C'\A. In particular, both of them are 
non empty, so S Y. Also, Y S. Indeed, if A C S', let x be a vertex of 
S\A and let t/ G C be a neighbor of x in the graph G. Clearly, x and y are 
also adjacent in G", contradicting the fact that S\A and G'\A are in different 
connected components of G'\Y . Thus, B ^ S and B ^Y. 

Let Gi,. . . ,Gp be the connected components of G\B and let Si , . . . , Sp be 
their neighborhoods in G. The Gfs are either connected components of G\S 
different from G or connected components of G'[G'\A]. In particular, if Ci is a 
full component associated to S, different from C, then Si = S. 

We will see that each Si is a minimal separator of G and that B is contai- 
ned in some full block (Si,Di). Consider first the case when Ci is a connected 
component of G\S, different from C. Then Si C S. Clearly, G* fl Sj = 0, so C 
is contained in some connected component Di of G\Si, with Di ^ Gi. Since 
each vertex of S has a neighbor in G, it follows that every vertex of Si has a 
neighbor in Di. Thus, Di is a full component associated to Si. Both Gi and Di 
are full components associated to Si, so Si is a minimal separator by lemma 1. 
Since G' C Di,we have that S = Ng{C) is contained in Di U NciDi) = (Si,Di). 
We conclude that B C (S,G) C (Si,Di). Consider now the case when Gi is a 
connected component of G'[C\A]. Notice that Si is contained in A. Let D be a 
full component associated to S, different from G. Since Si C (S', G), Si does not 
intersect D, so D is contained in a connected component Di associated to Si. 
Let us prove first that S C SiU Di. Indeed, S = Ng{D), so S is contained in 
Di U NciDi) C Si U Di. We show now that A\S is also contained in Si U Di. 
In the graph G", the vertex set S\A is a full component associated to A, so any 
vertex x G Y has a neighbor y G S\Y. In particular, if x G A\S, then y is also 
a neighbor of x in the graph G (the edges between S\A and A\S are the same 
in G and in G"). Thus, A\S is contained in Ng{S\Y). Since Si C Y we have 
S\A C S\S, C Di, so Y\S CDiU Ng{D,) C S, U A- Therefore, B C S,U D,. 
Observe now that Di is a full component associated to Si. Indeed, each vertex 
of Si n S has a neighbor in D, so in A, and each vertex of Si\S C A\S has a 
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neighbor in S\Y, so in Di. Thus, Di is a full component associated to Si. Clearly 
Di ^ Si, so by lemma 1, is a minimal separator. 

We have proved that each Si is a minimal separator of G and that B is 
contained in some full one-block (Si,Di). Moreover B is not contained in any 
Si', recall that each Si is contained in S or in Y, but B Y and B S, so 
B Si- By proposition 1, B is a block of G. 

S is one of the minimal separators bordering B. We show now that S is 
maximal by inclusion among {Si, , Sp}. Any minimal separator Si is contained 
in S or in Y. Since S ^Y, S is inclusion maximal among the minimal separators 
bordering B. We want to prove that S is not the only inclusion-maximal element 
of {S'!, . . . , Sp}. Let Gi be any connected component of G[C\y]. We show that 
Si 2 S. Indeed, if Si C S, then Gi is also a connected component of G\S, 
so the only choice is Gi = G. This implies M fl G = 0, contradicting Y % S. 
The set of minimal separators bordering B has at least two inclusion maximal 
elements, S and S' . The AT-free graph G has only one-blocks and two-blocks, 
so i? is a two-block P{S,S'). It remains to prove that S' = Nc{G\Y). Since 
S' 2 S, S' is the neighborhood of a connected component Gj of G[G\T]. Thus, 
S' C Nc{G\Y). For any connected component Gi C G\F, we have Si % S, 
so Si C S' by remark I. Consequently, S' contains Na{G\Y). We conclude 
that S' = Ng{G\Y) = U. Thus, B = P{S,U). Moreover, U = Ng{G\Y) C 
N'q{G\Y), so U CY. □ 

Corollary 3. Let G he an AT-free graph and let {S,C) be a full block of G. We 
denote by G' the eobipartite graph G{s_c} [S' U G]. If G' has a minimal separator 
of size at most k, then we can compute in polynomial time a minimal separator 
U of G such that P{S, U) < l^l -I- fc and \U\ < k. 

Proof. We compute a minimum size separator Y of GL This can be done in 
polynomial time. Clearly, M is a minimal separator of G' and |M| < k. According 
to proposition 8, the vertex set S' U F is a two-block P{S, U) of G. Therefore, 
\P{S,U)\ < |S| -I- |F| < |S| -I- k. By proposition 8, U C Y so \U\ < k. Moreover, 
U = Ng{G\Y), so U can be computed in polynomial time. □ 

5.2 Difficult Case: |X| > k 

Let us consider now the case when X = {T D G) U U has strictly more that k 
vertices. Our goal is to show that also in this case there is a minimal separator 
Y of G' = G{s_c}[ 5' U G] such that |F| < k. Thus, the corollary 3 will allow us 
to compute the minimal separator U like in theorem 2. 

Due to space restriction, we only give here an outline of the proof. A complete 
description is given in the full version of this paper. 

The construction of the vertex set Y is rather technical and will be made in 
several steps. We will show that there is a minimal separator T' of H such that 
T' crosses X. We prove then that P{T,T') exists. We will consider the block 
B = P(T, T') and we put F = (B n G) U Ng{G\B). 

The first crucial observation is that F is strictly contained in the block B. 
Moreover, F is a minimal separator of the co-bipartite graph G' = G{5^c}[>5'UG]. 
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The second step is to prove that |T | < k. This is true if our block i? is a clique 
of H, because the cliques of H have at most k+1 vertices. If B is not a clique, 
we can split it into smaller blocks, and we show that Y is strictly contained in 
one of these smaller blocks. We reiterate this process until we obtain a block B' 
that can not be splitted. This implies that B' is a clique, and since Y C B we 
conclude that |y| < k. 

Using corollary 3, we deduce that in this case we can also compute the mi- 
nimal separator U such that \U\ < k and \P{S,U)\ < |S'|-|-A:, which proves the 
theorem 2. 
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Abstract. Two graphs G and H with the same vertex set V are Pi- 
isomorphic if every four vertices {a, b, c,d}(-V induce a chordless path 
(denoted by P 4 ) in G if and only if they induce a P4 in H. We call a graph 
split-perfect if it is P 4 -isomorphic to a split graph (i.e. a graph being par- 
titionable into a clique and a stable set). This paper characterizes the 
new class of split-perfect graphs using the concepts of homogeneous sets 
and p-connected graphs, and leads to a linear time recognition algorithm 
for split-perfect graphs, as well as linear time algorithms for classical 
optimization problems on split-perfect graphs based on the primeval de- 
composition of graphs. These results considerably extend previous ones 
on smaller classes such as P 4 -sparse graphs, P 4 -lite graphs, P 4 -laden gra- 
phs, and (7,3)-graphs. Moreover, split-perfect graphs form a new subclass 
of brittle graphs containing the superbrittle graphs for which a new cha- 
racterization is obtained leading to linear time recognition. 



1 Introduction 

Graph decomposition is a powerful tool in designing efficient algorithms for ba- 
sic algorithmic graph problems such as maximum independent set, minimum 
coloring and many others. Recently, the modular, the primeval and the homo- 
geneous decomposition of graphs attracted much attention. The last two types 
of decomposition were introduced by Jamison and Olariu [34] and are based on 
their Structure Theorem and the concept of P4-connectedness. A P4 is an indu- 
ced path on four vertices. A graph G = (V, E) is P4-connected (p-connected for 
short) if for every partition Vi, V2 of V with nonempty Vi,V2, there is a P4 of 
G with vertices in Vi and in V2- 

We follow this line of research by introducing and characterizing a new class 
of graphs - the split-perfect graphs - for which the p-connected components have 
a simple structure generalizing split graphs. As usual, a graph is called a split 
graph if its vertex set can be partitioned into a clique and a stable set. 

The p-connected components represent the nontrivial leaves in the primeval 
decomposition tree, and thus some basic algorithmic problems can be solved in 
linear time along the primeval decomposition tree. 

The primeval tree is a generalization of the cotree representing the structure 
of the well-known cographs i.e. the graphs containing no induced P4. A cograph 
or its complement is disconnected, and the cotree expresses this in terms of 
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corresponding co-join and join operations. The cotree representation of a cograph 
is essential in solving various NP-hard problems efficiently for these graphs; see 
[14,15] for more information on P 4 -free graphs. 

Natural extensions of P 4 -free graphs are graphs with few P4S, such as 

1. Pi-reducihle graphs [29,32] (no vertex belongs to more than one P 4 ), 

2. P 4 ~sparse graphs [25,31,33,36] (no set of five vertices induces more than one 

^^4), 

3. P 4 -lite graphs [30] (every set of at most six vertices induces at most two P 4 ’s 

or a “spider”), and 

4. P 4 ~laden graphs [22] (every set of at most six vertices induces at most two 

Pi’s or a split graph). 

Note that in this order, every graph class mentioned in this paragraph is a 
subclass of the next one. 

Recently, Babel and Olariu [3] considered graphs in which no set of at most 
q vertices induces more than t P 4 ’s, called {q,t)~ graphs. The most interesting 
case is t = q — 4: (4, 0)-graphs are exactly the P 4 -free graphs, (5, l)-graphs 
are exactly the P 4 -sparse graphs, and it turns out that P 4 -lite graphs form a 
subclass of (7, 3)-graphs. For all these graphs, nice structural results lead to 
efficient solutions for classical NP-hard problems. Our new class of split-perfect 
graphs extends all of them. 

Another motivation for studying graph classes with special P 4 structure stems 
from the greedy eoloring heuristic: Define a linear order < on the vertex set, and 
then always color the vertices along this order with the smallest available color. 
Chvatal [13] called < a perfect order of G if, for each induced subgraph H of G, 
the greedy heuristic colors PI optimally. Graphs having a perfect order are called 
perfectly orderahle (see [26] for a comprehensive survey); they are NP-hard to 
recognize [39] . Because of the importance of perfectly orderable graphs, however, 
it is natural to study subclasses of such graphs which can be recognized efficiently. 
Such a class was suggested by Chvatal in [12]; he called a graph G brittle if each 
induced subgraph P[ of G contains a vertex that is not an endpoint of any P 4 
in P[ or not a midpoint of any P 4 in P[. Brittle graphs are discussed in [27,42, 
43]. Babel and Olariu [3] proved that (7, 3)-graphs are brittle, and Giakoumakis 
[22] proved that Rj-laden graphs are brittle. A natural subclass of brittle graphs, 
called superhrittle, consists of those graphs G in which every vertex is not an 
endpoint of any P 4 in G or not a midpoint of any P 4 in G. Split graphs are 
superbrittle since in a split graph with clique G and stable set S, every midpoint 
of a P 4 is in C and every endpoint of a P 4 is in S. Superbrittle graphs are 
characterized in terms of forbidden induced subgraphs in [40] . We will show that 
our new class of split-perfect graphs is a subclass of brittle graphs, containing 
all superbrittle graphs. Moreover, we construct a perfect order of a split-perfect 
graph efficiently, and we obtain a new characterization of superbrittle graphs 
leading to a linear time recognition. 

Yet another motivation for studying split-perfect graphs stems from the 
theory of perfect graphs. A graph G is called perfect if, for each induced subgraph 
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H of G, the chromatic number of H equals the maximum number of pairwise 
adjacent vertices in H . For example, all the above-mentioned graphs are perfect. 
For more information on perfect graphs, see [6,11,23]. Recognizing perfect gra- 
phs in polynomial time is a major open problem in algorithmic graph theory. 
Two graphs G and H with the same vertex set V are P^-isomorphic if, for all 
subsets S C V, S induces a P4 in G if and only if S induces a P4 in H. Chvatal 
[18] conjectured and Reed [41] proved that two P4-isomorphic graphs are both 
perfect or both are imperfect. Thus, to recognize perfect graphs it is enough 
to recognize the P4-structure of perfect graphs: Given a 4-uniform hypergraph 
H = (V,£). Is there a perfect graph G = (U, E) such that S' G f if and only if 
S induces a P4 in G? This was done for the case when the perfect graph G is 
a tree [20,9,10], a block graph [7], the line graph of a bipartite graph [44] or a 
bipartite graph [2]. 

Another question arising from Reed’s theorem is the following: Which (per- 
fect) graphs are P4-isomorphic to a member of a given class of perfect graphs? 
Let C be a class of perfect graphs. Graphs P4-isomorphic to a member in C are 
called C -perfect graphs. By Reed’s theorem, C-perfect graphs are perfect. Moreo- 
ver, they form a class of graphs which is closed under complementation and 
contains C as a subclass. Thus, it is interesting to ask the following question: 
Assume that there is a polynomial time algorithm for testing membership in C. 
Gan C-perfect graphs be recognized in polynomial time, too? First results in this 
direction are good characterizations of tree-perfect graphs, forest-perfect graphs 
[8] , and bipartite-perfect graphs [35] . This paper will give a good characterization 
of split-perfect graphs. 

Definition 1. A graph is called split-perfect if it is PA-isomorphic to a split 
graph. 

Trivial examples of split-perfect graphs are split graphs and P4-free graphs. 
Nontrivial examples are induced paths P„ = V 1 V 2 ■ ■ ■ Vn for any integer n. To see 
this we need some definitions, following [8]. Let (ui, . . . , u„) be a vertex order of 
a graph G. Then N^i{vi) denotes the set of all neighbors Vk of Vi with k > i. 




Fig. 1: Elementary graphs illustrated 
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A vertex order (wi, . . . , w„) of G is said to be elementary if for all i: 






{vi+ 2 , ^'^+ 3 , ■ • ■ , w„} for even i 
{wi+i} for odd i 




Graphs having elementary orders are split graphs in which the “odd vertices” 
V 2 k+i form a stable set and the “even vertices” V 2 k form a clique. A graph is 
said to be elementary if it has an elementary order. If the elementary graph has 
at least 4 vertices then its partition into a clique and a stable set is unique and 
can be determined using its degree sequence. Thus, as split graphs in general 
[24], elementary graphs can be recognized in linear time. 

Obviously, = v\V 2 ■ ■ ■ Vn is P 4 -isomorphic to the elementary graph con- 
sisting of the elementary order (ui,... ,u„). It can be seen that, for n > 7, 
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this elementary graph is the only split graph (up to “complementation” and 
“bipartite complementation”) that is P4-isomorphic to In Section 4 we will 
extend this example to the so-called double-split graphs which play a key role 
for characterizing split-perfect graphs. 

In section 2 we will show that the class of split-perfect graphs contains all P4- 
laden graphs and all (7,3)-graphs (hence all P4-reducible, Pj-sparse and P4-lite 
graphs). The relationship between the above-mentioned graph classes is shown 
in Figure 2. 

In section 3, we describe forbidden induced subgraphs of split-perfect graphs 
which are needed to characterize split-perfect graphs. 

In section 4, we introduce double-split graphs and show that they are split- 
perfect. As already mentioned, double-split graphs are of crucial importance for 
a good characterization of split-perfect graphs. 

In section 5, we characterize split-perfect graphs in terms of forbidden sub- 
graphs and in terms of their p-connected components: It turns out that for 
split-perfect graphs without homogeneous sets, the p-connected components are 
double-split graphs or their complements. 

In the last section, we will point out how classical optimization problems 
such as maximum clique, minimum coloring, maximum independent set, and 
minimum clique cover can be solved efficiently, in a divide and conquer manner, 
on split-perfect graphs using the primeval decomposition tree. These results are 
based on our good characterization of p-connected split-perfect graphs. 

Due to the space limitation for this extended abstract, we omit all proofs; 
some of them are quite long and technically involved. The interested reader will 
find them in the full version of our paper. 

2 Preliminaries 

Our notions are quite standard. The neighborhood of the vertex u in a graph 
G is denoted by Nq{v); if the context is clear, we simply write N{v). The path 
(respectively, cycle) on m vertices vi, V2, ■ ■ ■ , Vm with edges ViVi+i (respectively, 
ViVi+i and viVm) (1 < t < to) is denoted by Pm = V1V2 ■ ■ ■ Vm (respectively. 
Cm = V1V2 ■ ■ ■ VmVi)- The vertices vi and Vm are the endpoints of the path Pm, 
and for a P4 W1U2U3U4, V2 and U3 are the midpoints of the P4. Graphs containing 
no induced subgraphs isomorphic to a given graph H are called H-free graphs. 
It is well-known that split graphs are exactly the (C4, C4, Cs)-free graphs [21]. 

For convenience, for S C V{G), let S denote the subgraph G[S'j induced by 
S. 

A set S of at least two vertices of a graph G is called homogeneous, if 
S yf V{G) and every vertex outside S is adjacent to all vertices in S or to 
no vertex in A. A homogeneous set M is maximal if no other homogeneous set 
properly contains M . It is well-known that in a connected graph G with connec- 
ted complement G, every two different maximal homogeneous sets are disjoint 
(see, e.g. [37]). In this case, the graph G* obtained from G by contracting every 
maximal homogeneous set to a single vertex is called the characteristic graph 
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of G. Clearly, G* is connected and has no homogeneous set. We shall use the 
following useful fact for later discussions (see Figure 3 for the graphs Gi). 

Lemma 1 ([28]). Every graph containing an induced G^ and no homogeneous 
set contains an induced P5 or G3 or G4. 

Throughout this paper, we use the following basic property: every homoge- 
neous set contains at most one vertex of every P4. 

Following Jamison and Olariu [34], a graph is called P^-connected (or p- 
connected) if, for every partition of its vertex set into two nonempty disjoint 
parts, there exists a P4 containing vertices from both parts. It is easy to see that 
every graph has a unique partition into maximal induced p-connected subgraphs 
(called p-connected components) and vertices belonging to no P4. For the subse- 
quent Structure Theorem of Jamison and Olariu we need the following notion: 
A p-component H of G is called separable if it has a partition into nonempty 
sets Hi, H2 such that every P4 with vertices from both Hi's has its midpoints in 
Hi and its endpoints in H2 ■ Note that a p-connected graph is separable if and 
only if its characteristic graph is a split graph [34] . 

Theorem 1 (Structure Theorem, [34]). For an arbitrary graph G, precisely 
one of the following conditions is satisfied: 

(i) G is disconnected 

(ii) G is disconnected 

(iii) G is p-connected 

(iv) There is a unique proper separable p-component H of G with a partition 
{Hi,H 2) such that every vertex outside H is adjacent to all vertices in Hi 
and misses all vertices in H2- 

Based on this theorem, Jamison and Olariu define the primeval decomposi- 
tion which can be described by the primeval decomposition tree and leads to 
efficient algorithms for a variety of problems if the p-connected components are 
sufficiently simple. We will show that this is the case for split-perfect graphs. 

Note that dividing a graph into p-connected components can be done in 
linear time (see [5]). This fact and the subsequent Proposition 1 below allows us 
to restrict our attention to p-connected split-perfect graphs only. 

Proposition 1. A graph is split-perfect if and only if each of its p-connected 
components is split-perfect. 

Observation 1. Let G be split-perfect and let H = {Gh, Sh, Eh) be a split 
graph P4-isomorphic to G. Assume that each of the sets {a, b, c, m} and {a, b, c, w} 
induces a P4 in G. Then exactly one of the following conditions holds: 

(i) a, b, c induce a path P3 in H , and u and v are both adjacent in H to an 
endpoint of the path H[a,b,c]. In particular, u and v both belong to the 
stable-part Sh of H . 

(ii) The statement (i) holds in H instead of H . In particular, u and v both 
belong to the clique-part Gh of H. 
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Proposition 2. Let G be a p-connected split-perfect graph. Then every homo- 
geneous set of G induces a P^-free graph. 

Proposition 3. Let G be a p-connected graph. G is split-perfect if and only if 

(i) every homogeneous set of G induces a P^-free graph, and 

(ii) G* is split-perfect. 

Propositions 1 and 3 allow us to consider only p-connected split-perfect graphs 
without homogeneous sets. 

Corollary 1. (i) P^-laden graphs are split-perfect. 

(ii) {7, 3) -graphs are split-perfect. 

3 Forbidden Induced Subgraphs for Split-Perfect Graphs 

As a consequence of Observation 1, we give a list of forbidden induced subgraphs 
of split-perfect graphs: These are the induced cycles Ck of length k > 5, the 
graphs Gi (1 < t < 8) shown in Figure 3, and their complements. It turns out 
(Theorem 2) that these forbidden induced graphs also characterize split-perfect 
graphs that have no homogeneous set. 

We need some notions. Let G and G' be two graphs with the same vertex 
set. An induced P 4 in G is bad if its vertices do not induce a P4 in G' (thus, 
P4-isomorphic graphs do not have bad P4S.). Another useful notion is suggested 
by Observation 1: Let G be a split-perfect graph and H a corresponding split 
graph having the same P4-structure. We call the clique and the stable set of H 
the two classes of H. Two vertices x,y va G are called equivalent (x ~ y) if they 
are in the same class of H. 

Now, Observation 1 means that in a split-perfect graph G, vertices x and 
y with the property that there are vertices a,b,c € V (G) — {u, u} such that 
{a, b, c, a:} and {a, b, c, y} both induce a P 4 are in the same class i.e. x ^ y. 
Therefore, a set of pairwise equivalent vertices induces a P4-free graph in G. 

Lemma 2. No split-perfect graph has a Gk, Gk {k > 5), Gj, Gi (1 < t < 8) as 
an induced subgraph. 



4 Double-Split Graphs 

We define now the class of double-split graphs generalizing the split graphs and 
playing a key role in the subsequent characterization of split-perfect graphs. As 
an important step towards this characterization, we will show that double-split 
graphs are split-perfect. 

Definition 2. A graph is called double-split if it can be obtained from two dis- 
joint (possibly empty) split graphs Gl = {Ql, Sl, El), Gr = (Qr,Sr,Er) and 
an induced path P = P[xl,xr], possibly empty, by adding all edges between xr 
and vertices in Qr and all edges between xr and vertices in Qr. 
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Fig. 3: Forbidden induced subgraphs 
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Fig. 4: Double-split graphs illustrated 



Remark: Every split graph is double-split as the case of an empty path P and 
an empty split graph G/j shows. 

Lemma 3. Double-split graphs are split-perfect. 

Double-split graphs and their complements can be recognized in linear time 
due to their simple structure using techniques from [19] . 

5 The Structure of Split-Perfect Graphs 

We now are able to describe split-perfect graphs without homogeneous sets. They 
can be characterized as follows. 

Theorem 2. Let G be a graph having no homogeneous set. Then the following 
statements are equivalent: 

(i) G is split-perfeet; 

(ii) G has no induced subgraphs Gk, Ck {k > 5), Gi, Gi (1 < i < 8); 

(iii) G or G is a double-split graph. 

Theorem 2 and Propositions 1 and 3 immediately yield: 

Theorem 3. A graph G is split-perfect if and only if each of its p-connected 
component H has the following properties: Every homogeneous set in H induces 
a Pi-free graph, and H* is a double-split graph or the complement of a double- 
split graph. 

Corollary 2. Split-perfect graphs can be recognized in linear time. 

This follows from Theorem 3, and the following tasks can be carried out 
in linear time: finding the p-connected components of a graph [5], finding all 
maximal homogeneous sets of a (p-connected) graph [37,38] , recognizing P 4 -free 
graphs [16], and recognizing double-split graphs and their complements. 
Theorem 4. A graph G is superbrittle if and only if for each of its p-connected 
components H of G, 

(i) the homogeneous sets of H are cographs and 

(ii) the characteristic graph H* is a split graph. 
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Corollary 3. Superbrittle graphs are split-perfect and can be recognized in linear 
time. 

Corollary 4. Split-perfect graphs are brittle. Moreover, a perfect order of a 
split-perfect graph can be constructed efficiently. 



6 Optimization in Split-Perfect Graphs 

As already mentioned, Theorem 1 implies a decomposition scheme, called prime- 
val decomposition, for arbitrary graphs. The corresponding tree representation, 
called primeval tree, has the p-connected components and vertices not belonging 
to any P 4 of the considered graph as its leaves. The important features of the 
primeval tree of a given graph G are: 

1. If an optimization problem such as maximum clique, minimum coloring, ma- 
ximum independent set, and minimum clique cover can be solved efficiently 
on the p-connected components of G, then one can also efficiently solve the 
problem on the whole graph G; see for example [1]. 

2. The primeval tree can be constructed in linear time; see [5]. 

Based on these facts, linear time or at least polynomial time algorithms 
have been found for classical NP-hard problems on many graph classes such as 
{q, q — 4)-graphs and various subclasses. 

We now point out, how to compute the maximum clique size o’(G) for p- 
connected split-perfect graphs G. First, let be a homogeneous set in G. Then 

w(G) = max(a;(i7) -I- oj{N{H)),uj{GI H)), 

where G/H is the graph obtained from G by contracting H to a, single vertex. 
Thus, if oj{G*) and u>{H) for all homogeneous sets H in G can be computed in 
linear time, then also w(G). 

Now, if G is a p-connected split-perfect graph, then by Theorem 2, G* is a 
double-split graph or the complement of a double-split graph. In any case, as G* 
is a chordal graph or the complement of a chordal graph, w(G*) can be computed 
in linear time. Further, by Proposition 2, every homogeneous set H of G induces 
a P 4 -free graph, hence co{H) can be computed in linear time. This and the facts 
that the primeval tree of G as well as all maximal homogeneous sets of G can 
be found in linear time show that w(G) can be computed in linear time. The 
problem of minimum coloring can be solved similarly; we omit the details. Note 
that for perfect graphs in general and in particular for split-perfect graphs, the 
two parameters coincide. 

Since split-perfect graphs are closed under complementation, the indepen- 
dence number and the clique cover number of a split-perfect graph can be com- 
puted via the clique number, resp., the chromatic number of the complement of 
the graph considered. Thus, we can state the following result. 
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Theorem 5. The clique number, the chromatic number, the independence num- 
ber, and the clique cover number of a split-perfect graph can be computed in linear 
time. 

The same holds for the weighted version of clique number and chromatic 
number. 
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Abstract. In this paper, we present parallel algorithms for the coarse 
grained multicomputer (CGM) and the bulk synchronous parallel compu- 
ter (BSP) for solving two well known graph problems: (1) determining 
whether a graph G is bipartite, and (2) determining whether a bipartite 
graph G is convex. 

Our algorithms require O(logp) and O(log^p) communication rounds, 
respectively, and linear sequential work per round on a CGM with p 
processors and N/p local memory per processor, A=|G|. The algorithms 
assume that ^ > p’‘ for some fixed e > 0, which is true for all commer- 
cially available multiprocessors. Our results imply BSP algorithms with 
O(logp) and O(log^p) supersteps, respectively, 0{g\og{p)^) communi- 
cation time, and 0(log(p)^) local computation time. 

Our algorithm for determining whether a bipartite graph is convex inclu- 
des a novel, coarse grained parallel, version of the PQ tree data structure 
introduced by Booth and Lueker. Hence, our algorithm also solves, with 
the same time complexity as indicated above, the problem of testing the 
consecutive-ones property for (0, 1) matrices as well as the chordal graph 
recognition problem. These, in turn, have numerous applications in graph 
theory, DNA sequence assembly, database theory, and other areas. 



1 Introduction 

In this paper, we study the problem of detecting bipartite graphs and convex 
bipartite graphs. That is, given an arbitrary graph G, determine whether G is a 
bipartite graph and, given a bipartite graph G, determine whether G is a convex 
bipartite graph. Bipartite and convex bipartite graphs are formally defined as 
follows. 

Definition 1. A graph G = (V,E) is a bipartite graph if V can be partitioned 
into two sets A and B such that Ani? = 0, AU B = V and E C ((A x B) U 
{B X A)). A bipartite graph G is also denoted as G = (A,B,E). 
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Definition 2. A bipartite graph G = (A,B,E) is a convex bipartite graph if 
there exists an ordering (6i, 62, • ‘ ‘ ; ^\b\) of B such that, for all a € A and 1 < 
* < J < 1^1; if (o,bi) € E and (a, bj) G E then {a, bk) G E for all i < k < j. 

These, and closely related, problems has been extensively studied for the 
sequential [1,15] and the shared memory (PRAM) parallel [4,5,11,12,13,14] do- 
main. Unfortunately, theoretical results from PRAM algorithms do not neces- 
sarily match the speedups observed on real parallel machines. In this paper, we 
present parallel algorithms that are more practical in that the assumptions and 
cost model used reflects better the reality of commercially available multiproces- 
sors. More precisely, we will use a version of the BSP model, referred to as the 
coarse grained multicomputer (CGM) model. In contrast to the BSP model, the 
CGM [6, 7, 8, 9] allows only bulk messages in order to minimize message overhead 
costs. A GGM is comprised of a set of p processors P\,...,Pp with 0{N/p) 
local memory per processor and an arbitrary communication network (or sha- 
red memory) . All algorithms consist of alternating local computation and global 
communication rounds. Each communication round consists of routing a single 
/i-relation with h = 0{N/p), i.e. each processor sends 0{N/p) data and receives 
0{N/p) data. We require that all information sent from a given processor to 
another processor in one communication round is packed into one long message, 
thereby minimizing the message overhead. A GGM computation/communication 
round corresponds to a BSP superstep with communication cost g^ (plus the 
above “packing requirement”). Finding an optimal algorithm in the coarse grai- 
ned multicomputer model is equivalent to minimizing the number of communi- 
cation rounds as well as the total local computation time. The GGM model has 
the advantage of producing results which correspond much better to the actual 
performance of implementations on commercially available parallel machines. In 
addition to minimizing communication and computation volume, it also minimi- 
zes important other costs like message overheads and processor synchronization. 

In this paper, we present parallel GGM algorithms for detecting bipartite gra- 
phs and convex bipartite graphs. The algorithms require 0(logp) and 
O(log^p) communication rounds, respectively, and linear sequential work per 
round. They assume that the local memory per processor, N/p, is larger than 
p'^ for some fixed e > 0. This assumption is true for all commercially availa- 
ble multiprocessors. Our results imply BSP algorithms with O(logp) supersteps, 
0{glog{p)^) communication time, and 0(log(p)^) local computation time. 

The algorithm for detecting bipartite graphs is fairly simple and is essentially 
a combination of tools developed in [3] . The larger part of this paper deals with 
the problem of detecting convex bipartite graphs. This is clearly a much harder 
problem. It has been extensively studied in the literature and is closely linked 
to the consecutive ones problem for (0, l)-matrices as well as chordal graph 
recognition [1,4,5,11,12,13,14,15]. 

Our algorithm for determining whether a bipartite graph is convex includes 
a novel, coarse grained parallel, version of the PQ tree data structure introduced 
by Booth and Lueker [1]. Hence, our algorithm also solves, with the same time 
complexity as indicated above, the problem of testing the consecutive-ones pro- 
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perty for (0, l)-matrices as well as the chordal graph recognition problem. These, 
in turn, have numerous applications in graph theory, DNA sequence assembly, 
database theory, and other areas. [1,4,5,11,12,13,14,15] 

2 Detecting Bipartite Graphs 

In this section, we present a simple CGM algorithm for detecting bipartite gra- 
phs. It is a straight-forward combination of tools developed in [3]. 

Algorithm 1 Detection of Bipartite Graphs 

Input: A Graph G = (V,E) with vertex set V and edge set E, |G| = N, stored on a 
CGM with p processors and 0{N/p) memory per processor; N/p > p" for some fixed e > 0. 
V and E are arbitrarily distributed over the memories of the CGM. Output: A Boolean 
indicating whether G is a bipartite graph and, if it is, a partition of V into two disjoint set 
A and B such that E C ((A x B) U {B x A)). 

(1) Compute a spanning forest of G [3]. 

(2) For each tree in the forest, select one arbitrary node as the root. Apply the CGM 
Euler Tour algorithm in [3] to determine the distance between each node and the 
root of its tree. Classify the nodes into two groups: the nodes with an odd numbered 
distance to the root, and the nodes with an even numbered distance to the root. 

(3) Each processor examines the edges stored in its local memory. If any such edge has 
two vertices that belong to the same group, the result for that processor is “failure" ; 
otherwise, the result is “success". 

(4) By applying CGM sort [10] to all “failure" / “success" values, it is determined whether 
there was any processor with a “failure" result. If there was any “failure" , the graph 
G is not bipartite. Otherwise, G is a bipartite graph, and the two groups of vertices 
identified in Step 2 are the sets A and B. 



Theorem 1. Algorithm 1 detects whether G = (V,E), |G| = N, is a bipartite 
graph and, if so, partitions E into sets A and B such that E C ((Ax i?)U(i?x A)) 
in 0{logp) communication rounds and 0(y) local computation per round on a 
CGM with p processors and 0{^) memory per processor, ^ > p^ for some fixed 
e > 0. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 



3 Detecting Convex Bipartite Graphs 

We now turn our attention to the problem of testing whether a given bipartite 
graph is a convex bipartite graph. The sequential solution, presented by Booth 
and Lueker [1], introduced a data structure called PQ-tree. Our coarse grained 
parallel solution will include a novel coarse grained parallel version of the PQ- 
tree. We will first review Booth and Lueker’s PQ-tree definition. 
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Definition 3. A tree T is a PQ-tree if every internal node ofT can be classified 
as either a P-node or a Q-node. A P-node is an internal node that has at least 2 
children, and the children can he permuted arbitrarily. A Q-node is an internal 
node that has at least 3 children, and the children can only he permuted in two 
ways: the original order or the reverse order. The leaves of the PQ-tree are 
elements of a universal set S = {oi, . . . ,a„}, usually called the ground set. 

The order of the ground set in the PQ-tree, from left to right, is called its 
frontier. The frontier of a PQ-tree is clearly a permutation of the ground set. 
Given a PQ-tree T and using only permissible permutations of its internal nodes, 
we can generate a number of permutations of S. We will denote with L(T) the 
set of all these permissible permutations. A PQ-tree T' is equivalent to T if T' 
can be transformed into T using only permissible permutations of the internal 
nodes (if L{T') and L{T) have the same elements). 

Given a set A C S, we say that A G L{T) satisfies A if all elements of A 
appear consecutively in A. The main operation on a PQ-tree T is called reduce: 
given a reduction set A = {Ax, . . . , Ak\ of subsets of S and a PQ-tree T, we want 
obtain a PQ-tree T' , if it exists, such that each permutation in L{T') satisfies 
every A^, 1 < i < k. 

Let m = S^^fi\Ai\ and N = n -\- m. In order to store T and A, we require 
a coarse grained multicomputer with p processors and N/p local memory per 
processor. 

Two particular PQ-trees are the universal and the empty tree: the first one 
has only one internal node (the root of T) and that internal node is a P-node; 
the second one (also called a null PQ-tree) is used to represent an impossible 
reduction, that is when it is impossible to reduce a PQ-tree with respect to a 
given reduction set. 

3.1 Multiple Disjoint Reduce Operations on a PQ-Tree 

In this section, we will present a coarse grained parallel algorithm for the spe- 
cial case of performing multiple disjoint reductions on a PQ-tree. We will then 
use this solution to develop the general algorithm in the subsequent section. 
More precisely, given a PQ-tree T we will first study how to perform the reduce 
operation for a set A = {Ai, . . . , A^} of subsets of the universal set S where 
Al, . . . , Afc are disjoint. We shall refer to our algorithm as Algorithm MDReduce. 
For ease of discussion, each set Ai is assigned a unique color, and we color the 
leaves of the PQ-tree accordingly. Some of the PQ-tree definitions used are from 
[ 1 , 12 ]. 

We start with a pre-processing phase which extends the coloring 6 of the 
leaves to a coloring A of all nodes of the PQ-tree T. For an internal node v 
of T, we say that a color is complete at v if all the leaves with that color are 
descendants of v. We say a color is incomplete at v if some, but not all, of the 
leaves of that color are descendants of v. We say that a color covers v if all 
the leaves below v are of that color, and that v is uncovered if no color covers 
v. Let LCA(c) be the lowest common ancestor of all leaves with color c. Let 
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COLORS{v) denote the set of colors assigned to leaves that are descendents of 
V. Let INC{v) be the set of colors which are incomplete at v. Then INC{v) = 
COLORS{v) - {c: LCA{c) is a descendent of f}. 

Algorithm 2 Pre-Processing the PQ-Tree 
Input: The original PQ-tree T. 

Output: The original PQ-tree T in which each node is assigned a "coloring" A, or, if 
failure occurs, a null tree. 

(1) Apply the coarse grained parallel Lowest Common Ancestor (LCA) algorithm [3]. 

(2) Expand T into a binary tree B. 

(3) Perform tree contraction on B\ see [3]. For each node Vb in B, let Vp be the node in T 
from which Vb is created. Let wi and W 2 be the children of u;,. The operation for the 
tree contraction is INC(vb) = INC{wi) UlNC{w 2 ) - {c: LCA{c) is a descendent 
of Up}. If at any point the size of INC is more than two, stop and return a null tree. 

(4) Let Cv be a new color unique to node v. Each processor, for all its nodes, v, calculates 
A(v) = < ci,C2 > as follows: If two colors are incomplete at v, then ci and C2 are 
these colors. If only one color c is incomplete at v but c does not cover v, then ci = 
c and C 2 = Cv If one color c is incomplete at v and covers v, then ci = C2 = c. If 
no color is incomplete at v, then ci = C2 = c„. 



Lemma 1. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor, Algorithm 2 can he completed in 0{logp) communication 
rounds with 0{^) local computation per round. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 



A node u in a PQ-tree is orientable if it is a Q-node and the two colors in its 
Z\(u) =< Cl, C2 > are different, i.e. ci yf C2- 

For a color c, define hp{c) = { ^ 

For a PQ-tree T where w\ and Wk are the leftmost and rightmost elements, 
respectively, of the frontier frriv), let It = hv{ 6 {wi)) and = hv{ 6 {wk)). If 
Irrlv] =< lT[v],rT[v] > then we use the following notation: < a,b >~< a',b' > 
if {a, 6 } = {o', b'}. 



Algorithm 3 Processing P-Nodes 
Input: The PQ-Tree output from Algorithm 2. 

Output: The original PQ-tree T in which all the P-nodes have been processed, or, if 
failure occurs, a null tree. 

(1) If the input PQ-tree T is a null tree, return T. 

(2) Each processor sets variable FAILURE to FALSE 

(3) Each processor, for each P-node v, reorder the children of v such that for each color 
c all children covered by c are consecutive. 

(4) Each processor, for each P-node v and each color c, if there are at least two children 
covered by c (and at least one child not covered by c) then insert a new P-node Wc 
between these c-covered children and v. 




88 



E. Caceres et al. 



(5) Each processor, for each P-node v, constructs an auxiliary graph G„ whose nodes are 
the children of v and where for each color c there is an edge between children Vi and 
Vj at which c is incomplete if Vi or Vj is covered by c, or there is no child covered by 
c. If any node has more than 2 neighbors, set FAILURE to TRUE to indicate a 
failure condition. 

(6) Perform a multi-broadcast of the variable FAILURE. If any of the broadcast values 
is TRUE, return a null tree. 

(7) Each processor uses list-ranking to identify the connected components of each Gv 
and verifies that each of these connected components is a simple path. If any of these 
components is a cycle, set EAILURE to TRUE to indicate a failure condition. We 
call these paths color chains. 

(8) Perform a multi-broadcast of the variable FAILURE. If any of the broadcast values 
is TRUE, return a null tree. 

(9) Each processor, for each color chain \ containing at least 2 nodes, chooses one of 
the 2 orientations of \ arbitrarily. Reorder the children of v so that the nodes of x 
are consecutive, and insert a new Q-node between these nodes of v. 

(10) Each processor, for each P-node v, let S = {vi : Wi is a child of u, and INC{vi)—i/)} . 
If every child of v is in S, then return. Otherwise, reorder the children of v to make 
S consecutive, insert a new P-node v' between v and the subset S (if |S| > 1), and 
rename w to be a Q-node. 



Lemma 2. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor, Algorithm 3 can he completed in 0{logp) communication 
rounds with 0{^) local computation per round. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 

For each Q-node v, we define an orientation LR{v) which is either A(vi) 
or A{vi)^. Note that if A{vi) =< Ci,C2 > than A{vi)^ =< C2,Ci > For < 
0,6 >~< a',b' > and o yf 6, we define < a,b > swap < a',b' > equals TRUE 
if < 0,6 >=< 6', o' >, FALSE if < o, 6 >=< o', 6' >. For a Q-node v, flip is 
defined as the operation which re-orders all its children in reverse order. 

Algorithm 4 Processing Q-Nodes 
Input: The PQ-tree output from Algorithm 3. 

Output: The original PQ-tree T in which all the Q-nodes have been processed, or, if 
failure occurs, a null tree. 

(1) If the input PQ-tree T is a null tree, return T. 

(2) Each processor sets variable E AI LU RE to EALSE 

(3) Each processor, for each Q-node v and children be i>i, . . . ,Vs, assign to each LR[vi] 
either A(vi) or A{vi)^ such that every color in the sequence LR[vi ], . . . , LR[vs] oc- 
curs consecutively, and such that 6,„(< L[vi], R[va >]) ~ A{v). If this is impossible, 
set FAILURE to TRUE to indicate a failure condition, otherwise, set LR[v] to 
6„(< L[t)i],i?[us] >). 

(4) Perform a multi-broadcast of the variable FAILURE. If any of the broadcast values 
is TRUE, return a null tree. 

(5) Each processor for each node v. if v is orientable, then set OPP[v] to LR[v] swap 
LR[v], otherwise, set OPP[v] to FALSE. 
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(6) Each processor for each node v. set REVlv] to © OPP\u] (Note: 

' ' ■- -* It IS an ancestor of d l j v 

denotes "exclusive-or"). 

(7) For each orientable node v, \f REV[v] is TRUE, then flip v. 

Lemma 3. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor, Algorithm 4 can be completed in 0{logp) communication 
rounds with 0{^) local computation per round. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 



Algorithm 5 Post-Processing the PQ-Tree 

Input: The PQ-tree output from Algorithm 4, with all R-nodes renamed. 

Output: Result of Algorithm MDReduce. 

(1) If T is a null tree, return. 

(2) Each processor temporarily cuts the links of its Q-nodes to their parents. 

(3) Each processor performs pointer jumping for all its nodes that are children of R-nodes 
to determine their lowest Q-node ancestor. 

(4) Each processor restores the links cut in Step 2. 

(5) Each processor eliminates its R-nodes by setting the parents of their children to their 
lowest Q-node ancestors. 



Lemma 4. On a coarse grained multicomputer with p processors and 0(— ) sto- 
rage per processor, Algoritm 5 can he completed using in O(logp) communication 
rounds with 0{^) local computation per round. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 



Theorem 2. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor. Algorithm MDReduce performs a multiple disjoint reduce 
for a PQ-tree T in O(logp) communication rounds with 0{^) local computation 
per round. 

Proof. Omitted due to page restrictions. To be included in the full version of 
this paper. □ 



3.2 Multiple General Reduce Operations on a PQ-Tree 

Using the coarse grained parallel MDreduce algorithm presented in the previous 
section, we will now develop coarse grained parallel algorithm for the general 
M Reduce operation: given a PQ-tree T over the ground set S with n elements, 
perform the reduce operation for an arbitrary reduction sets A = {A \, . . . , A/-} 
Our COM algorithm for the general MReduce operation consists of two pha- 
ses. In the first phase, we execute 31ogp times an algorithm which is a COM 
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implementation of a PRAM algorithm proposed by Klein [12]. We call this ope- 
ration MreducEi(T, {Al, . . Afc}, 0). Our contribution here is the implementa- 
tion of the various shared memory PRAM steps on a distributed memory CGM, 
which is non trivial. After this first phase, we have reduced the problem to one in 
which we are left with a set of smaller PQ-trees over ground sets whose size is at 
most n/p. Hence, each tree can be stored in the local memory of one processor. 
However, we can not guarantee that all the reduction sets of these PQ-trees do 
also fit in the local memory of one processor. In the second phase of our algo- 
rithm, we use a merging strategy to complete the algorithm. We will refer to 
this phase as the Merging Phase. 

First Phase: For a node u of a PQ-tree, leavesriv) denotes the set of pendant 
leaves of v, i.e. leaves of T having v as ancestor. Let IcariA) denote the least 
common ancestor in T of the leaves belonging to A. Suppose that v = IcaxiA) 
has children v\, . . .Vg in order. We say A is contiguous in T if either (1) u is a 
Q-node, and for some consecutive subsequence Vp,...,Vq of the children of v, 
A = [Jp<i<q leavesTivi), or (2) u is a P-node or a leaf, and A = leavesriv). 

Suppose that E is contiguous in T. T\E denotes the subtree consisting of 
IcaxiE) and those children of IcaxiE) whose descendents are in E (it is still a 
PQ-tree whose ground set is E). For a set A, define 

. ( A,r\EAA,r\E^E 

AA,nE = E 



Let denote IcariE). T/E denotes the subtree of T obtained by omitting 
all the proper descendents of IcariE) that are ancestors of elements of E (it is 
still a PQ-tree whose ground set is S' — if U |*_e}). For a set A, define 

. , _ ( Ai — EU {*£;} if Ai A if 

{ Aj — if otherwise 

Algorithm 6 Mreducei(T, (Ai, . . . , Ak}, i)-. 

(1) \f i = 31ogp, return. 

(2) Purge the collection of input sets Ai of empty sets. If no sets remain, return. 

(3) Let n be the size of the ground set of T. If n < 4, carry out the reduction one by one. 
If the size of the input is smaller than the size of the local memory of the processors, 
than solve the problem sequentially using the Booth and Lueker's algorithm. 

(4) Otherwise, let A be the family of (nonempty) sets Ai. Let S consist of the sets A; 
such that |Ai| < n/2. We call such sets ’’small’'. Let C be the remaining, ’’large”, 
sets in A. Find the connected components of the intersection graph of A, find a 
spanning forest of the intersection graph of S, and find the intersection f]C of the 
large sets. 

(5) Proceed according to one of the following cases: 

(a) The intersection graph of A is disconnected. In this case, let Ci,...,Cr be the 
connected components of A. For f = 1, . . . , r, let If i be the union of sets in 
the connected component Ci. Call MDreduce to reduce T with respect to 
the disjoint sets Ei, . . . , Er. Next, for each i = 1, . . . , r in parallel, recursively 
call MREDUCEi(T|Ai,Ci,i -F 1). 
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(b) The union of sets in some connected component of S has cardinality at least 
n/4. In this case, from the small sets making up this large connected compo- 
nent, select a subset whose union has cardinality between n/4 and 3n/4. Let 
E be this union, and call SUBREDUCe(T, E, {Ai, . . . , Ak},i)- 

(c) The cardinality of the intersection of the large sets is at most Snj4:. In this case, 
from the large sets choose a subset whose intersection has cardinality between 
n/4 and 3n/4. Let E be this intersection, and call SUBREDUCe(T, E, {Ai, . . . , 

Ak} , t) ■ 

(d) The other case do not hold. In this case, let E be the intersection of the large 
sets, and call subreduce(T, £1, {Ai, . . . , A^}, i). 

In the full version of this paper, we show how to implement the above on a 
coarse grained multicomputer with p processors and O(^) storage per processor 
in O(logp) communication rounds. The non trivial parts are Step 4, Step 5b, 
the computation of E, T/E, and T\E, as well as the subreduce operation. The 
latter involves another operation called Glue. Due to page restrictions, we can 
not present this part of our result in the extended abstract. Instead, we give one 
example which shows the coarse grained parallel computation of the set E in 
Step 5(b) of Algorithm 6. 

Algorithm 7 Computation of E. 

Input: The set S and the spanning forest of its intersection graph. 

(1) In order to find a connected component C in the spanning forest of <S, such that the 
union of its sets has cardinality at least n/4, order all the components according to 
the labeling given by the coarse grained parallel spanning forest algorithm [3]. 

(2) Sort each component with respect to the values of its elements and mark as "valid" 
only one element per distinct value. 

(3) Sort again with respect to the components' labels. Compute the cardinality of the 
union of the elements of each component (that is the size of each component), with 
a prefix-sum computation, counting only the "valid" elements. (Hence, we do not 
count twice the elements with same values and compute correctly the cardinality of 
the union.) 

(4) If a processor finds a component whose size is > n/4, then it broadcasts the label of 
this component. Otherwise it broadcast a "not-found" message. 

(5) If everybody sent "not-found", go to step 4(c) of Mreduce algorithm. Otherwise, 
among all the labels received in the previous step, choose as C the component with 
the smallest label. 

(6) For each of the sets comprising C, compute the distance in the spanning tree (from 
the root) using the coarse grained parallel Euler-tour technique [3]. 

(7) Sort the sets according to distance, and let Bi, . . . , Bs be the sorted sequence. Sort 

each sets with respect to the values of its elements and mark as "valid" only one 
element per distinct value. Sort the sets again, according to distance, and let i be 
the minimum i such that > n/4. (t can be found with a prefix-sum 

computation on the "valid" elements.) Broadcast i. 

(8) Mark all "valid” elements in Bi, . . . ,Bf as elements of E. 

Second Phase: Consider the tree R of recursive calls in Mreducei. We ob- 
serve that, after I = 31og4/3P levels of R (when the first part of our algorithm 
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stops), the sizes of the ground sets associated with the nodes in R at level I are 
at most n/p. This is due to the fact that the descendants of a node u in R that 
are 3 levels below u are smaller than u by approximately a factor 3/4. More 
precisely, if n{u) denotes the size of the ground set of T{u) (the subtree rooted 
at u) then, for every node w three levels below u, n{u) < 3n(w)/4 + 1. Hence, 
each PQ-tree obtained at the end of the first phase fits completely into the local 
memory of one processor. 

Unfortunately, the same argument does not hold for the reduction sets. Recall 
that TO = Let u be an internal node of R, . . . , its reduction 

sets, and to„ = Since the sizes of the reduction sets of the children of 

u depend strictly on the and on how they intersect with the set E computed 
for u, it is possible that the are split in an unbalanced way. That is, we 
can have = 0 (to„) and | = 0(1) (or vice versa). If this 

continues up to level 31ogp of R, it is possible that for a recursive call associated 
with a node v at level I, > m/p. 

Therefore, while the ground set of T(y), and hence T(v), can fit in one pro- 
cessor, the reduction sets could possibly not. Thus, at this point of the compu- 
tation, we can not simply use the sequential algorithm of Booth and Lueker [1] 
for completing the reduction. 

Our idea for solving this problem is the following. Let us consider a node v 
at level I in R that has to„ > m/p. Since, at any level of recursion, the sum of 
the sizes of all reduction sets is at most 2m, we can create copies of T{v), 
with We observe that 

w . P ^ P O -O 

\_ / J _ — ^P- 

m/p m/p m m 

Hence, we require at most two copies per processor. The reduction problem of 
each node v at level I of R will be solved by the processors that have copies 
of T{v). The next step is the distribution of the reduction sets associated to v 
among these processors. Each of these processors can solve locally the 
problem of reducing T{v) with respect to the reduction sets that it has stored, 
using Booth and Lueker’s algorithm [1]. For each processor, let T'{v) refer to 
this reduced tree. Now, we need to merge these trees, T'{v). More precisely, 
we need to compute a PQ-tree T{v) such that L{T{v)) = L{T{v)), where T{v) is 
the PQ-tree that we would have obtained by reducing T(v) directly with respect 
its reduction sets. For the construction of T(w), we merge the T'{v) trees in a 
binary tree fashion. 

Algorithm 8 Merging Phase 

Input: h PQ-trees T{i), with |T(i)| < n/p and Ei\T{i)\ < n, and their reduction sets. 
Output: The T{i) reduced with respect their reduction sets. 

(1) Let mi be the sum of the sizes of the reduction sets of Ti. Make at = copies 

of each T{i). Distribute the reduction sets of each Ti between the processors that 
have the copies of T(i). 

(2) Each processor executes the sequentiai aigorithm [1] for its PQ-trees with the reduc- 
tion sets that it has stored. Let T' (i) refer to the trees obtained. 
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(3) The ai processors associated with each T{i) merge the T'{v) trees in a binary tree 
fashion. More details are outlined below. 

The following Theorem 3 shows that the merge operation in Step 3 of Al- 
gorithm 8 reduces to a tree intersection operation. We have designed a CGM 
algorithm for tree intersection which implements Step 3 of Algorithm 8. Due 
to page restrictions, we can not include a description of our tree intersection 
algorithm in this extended abstract. It will be included in the full version of this 
paper. 

Theorem 3. Let T he a PQ-tree over the ground set S and let T' he a eopy of 
T. LetT* andT'* he the result of the reduction of T with respect to {Ai, . . . , Ar} 
and ofT' with respect to {i?i, . . . , Bt}, respectively. Let T he the PQ-tree obtained 
hy reducing T with respect to {Ai, . . . , Ar, Bi, . . . , Bt}. Then, 

A G L(T) A G L{T*) n L(T'*). 

Proof. L{T) is the intersection of the sets of all orderings that satisfy A\, . . . , Ar, 
Bi, . . . ,Bt, and L{T). L{T) is always the same, independently of the order in 
which we reduce T. If A G L{T), then A must belong to the intersection between 
the set of all orderings that satisfy A\, . . . ,Ar and L(T) and it must also belong to 
the intersection between the set of all orderings that satisfy B\, ... ,Bt and L(T), 
that is A G L{T*), A G L{T'*) and A G L{T). Hence A belongs to L{T*)C\L{T' *). 
The reverse can be shown analogously. □ 

In summary, we obtain 

Theorem 4. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor, Algorithm MReduce performs a reduce operation for a 
PQ-tree T in O(log^p) communication rounds with 0{^) local computation per 
round. 



3.3 Convex Bipartite Graphs 

Recall the definition of convex bipartite graphs (Definition 2) . Given a bipartite 
graph G = (A,B,E) with A = {oi, 02, • • • , Ofc} and B = {bi,b 2 , ■ ■ ■ ,hn}. Let 
A = {Ai, . . . , Ak} where At = {b € B : (ai,b) € E}, and let T be a PQ-tree over 
the ground set B consisting of a root with children &i, 62, • • • , 6n- The problem of 
determining whether G is convex and, if this is the case, computing the correct 
ordering of the elements in B is equivalent to the MReduce operation on T with 
respect to A. 

Theorem 5. On a coarse grained multicomputer with p processors and 0{^) 
storage per processor, the problem of determining whether G is convex (and 
computing the correct ordering of the elements in B) can he solved in O(log^p) 
communication rounds with 0{^) local computation per round. 
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Abstract. In a previous work, the authors introduced the class of graphs 
with bounded induced distance of order k, (BID(fc) for short) to model 
non-reliable interconnection networks. A network modeled as a graph in 
BID(fc) can be characterized as follows: if some nodes have failed, as long 
as two nodes remain connected, the distance between these nodes in the 
faulty graph is at most k times the distance in the non-faulty graph. The 
smallest k such that G G BID(fc) is called stretch number of G. In this 
paper we give new characterization, algorithmic, and existence results 
about graphs with small stretch number. 



1 Introduction 

The main function of a network is to provide connectivity between the sites. 
In many cases it is crucial that (properties about) connectivity is preserved 
even in the case of (multiple) faults in sites. Accordingly, a major concern in 
network design is fault-tolerance and reliability. That means in particular that 
the network to be constructed shall remain reliable even in the case of site faults. 

According to the actual applications and requirements, the term ‘reliability’ 
may stand for different features. In this work, it concerns bounded distances, 
that is our goal is to investigate about networks in which distances between 
sites remain small even in the case of faulty sites. As the underlying model, we 
use unweighted graphs, and measure the distance in a network in which node 
faults have occurred by a shortest path in the subnetwork that is induced by 
the non-faulty components. Using this model, in [7] we have introduced the class 
BID(/c) of graphs with hounded induced distance of order k. A network modeled 
as a graph in BID(fc) can be characterized as follows: if some nodes have failed, 
as long as two nodes remain connected, the distance between these nodes in the 
faulty graph is at most k times the distance in the non-faulty graph. 

Some characterization, complexity, and structural results about BID (A:) are 
given in [7]. In particular, the concept of stretch number has been introduced: 
the stretch number s{G) of a given graph G is the smallest rational number 
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the DFG, was visiting the Department of Computer Science, University of Rostock, 
Germany. 
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k such that G belongs to BID(A:). Given the relevance of graphs in BID(A:) in 
the area of communication networks, our purpose is to provide characterization, 
algorithmic, and existence results about graphs having small stretch number. 

Results. We first investigate graphs having stretch number at most 2. In this 
context we show that: (t) there is no graph G with stretch number s(G) such 
that 2 — j < s(G) < 2 — for each integer i> 1 (this fact was conjectured 
in [7]); {ii) there exists a graph G such that s(G) = 2 — i, for each integer i>l. 
These results give a partial solution to the following more general problem: 
Given a rational number k, is k an admissible stretch number, i.e., is there a 
graph G such that s(G) = kl We complete the solution to this problem by 
showing that every rational number fc > 2 is an admissible stretch number. 
Finally, we give a characterization result in term of forbidden subgraphs for the 
class BID (2 — y), for each integer i > 1. This characterization result allows us to 
design a polynomial time algorithm to solve the recognition problem for the class 
BID (2 — y), for each t > I (if fc is not fixed, this problem is Go-NP-complete for 
the class BID(fc) [7]). We conclude the paper by showing that such an algorithmic 
approach cannot be used for class BID(fc), for each integer k > 2. 

Related works. In literature there are several papers devoted to fault-tolerant 
network design, mainly starting from a given desired topology and introducing 
fault-tolerance to it (e.g., see [4,15,19]). Other works follow our approach. 

In [14], a study about our concepts is performed: they give characterizations 
for graphs in which no delay occurs in the case that a single node fails. These 
graphs are called self-repairing. In [8], authors introduce and characterize new 
classes of graphs that guarantee constant stretch factors k even when a multiple 
number of edges have failed. In a first step, they do not limit the number of 
edge faults at all, allowing for unlimited edge faults. Secondly, they examine the 
case where the number of edge faults is bounded by a value £. The correspon- 
ding graphs are called /c-self-spanners and (fc, t')-self-spanners, respectively. In 
both cases, the names are motivated by strong relationships to the concept of 
k-spanners [21]. Related works are also those concerning distance-hereditary gra- 
phs [18]. In fact, distance-hereditary graphs correspond to the graphs in BID(l), 
and graphs with bounded induced distance can be also viewed as a their pa- 
rametric extension (in fact, BID(fc) graphs are mentioned in the survey [2] as 
fc-distance-hereditary graphs). Distance-hereditary graphs have been investiga- 
ted to design interconnection network topologies [6,11,13], and several papers 
have been devoted to them (e.g., see [1,3,5,10,12,16,20,23]). 

The remainder of this extended abstract is organized as follows. Notations and 
basic concepts used in this work are given in Section 2. In Section 3 we recall 
definitions and results from [7]. Section 4 shows the new characterization results, 
and in Section 5 we answer the question about admissible stretch numbers. In 
Section 6 we give the complexity result for the recognition problem for the class 
BID(2 — y), for every integer i > 1, and in Section 7 we give some final remarks. 
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Due to space limitations, some proofs and technical details are omitted and will 
be provided in the full paper. 



2 Notation 



In this work we consider finite, simple, loopless, undirected and unweighted gra- 
phs G = {V, E) with node set V and edge set E. We use standard terminologies 
from [2,17], some of which are briefly reviewed here. 

A subgraph of G is a graph having all its nodes and edges in G. Given a 
subset S of V, the induced subgraph (S) of G is the maximal subgraph of G with 
node set S. |G| denotes the cardinality of V. If a: is a node of G, by Nc{x) we 
denote the neighbors of x in G, that is, the set of nodes in G that are adjacent 
to X, and by Ng[x] we denote the closed neighbors of x, that is Ng{x) U {x}. 
G — S' is the subgraph of G induced by F \ S. 

A sequence of pairwise distinct nodes (xq, . . . , x„) is a path in G if (xj, Xj+i) € 
E for 0 < i < n, and is an induced path if ({xq, . . . , x„}) has n edges. A graph 
G is connected if for each pair of nodes x and y oi G there is a path from x to y 
in G. 

A cycle Gn in G is a path (xq,... ,x„_i) where also (xo,x„_i) G E. Two 
nodes Xi and Xj are consecutive in G„ if j = {i+ 1) mod n or i = {j + 1) mod n. 
A chord of a cycle is an edge joining two non-consecutive nodes in the cycle. 
Eln denotes an hole, i.e., a cycle with n nodes and without chords. The chord 
distance of a cycle G„ is denoted by cd(G„), and it is defined as the minimum 
number of consecutive nodes in G„ such that every chord of G„ is incident to 
some of such nodes. We assume cd(i?„) = 0. 

The length of a shortest path between two nodes x and j/ in a graph G 
is called distance and is denoted by dG{x,y). Moreover, the length of a longest 
induced path between them is denoted by Dg{x, y). We use the symbols Pa{x, y) 
and pa{x,y) to denote a longest and a shortest induced path between x and y, 
respectively. Sometimes, when no ambiguity occurs, we use Pg{x, y) and pg{x, y) 
to denote the sets of nodes belonging to the corresponding paths. /g(x, y) denotes 
the set containing all the nodes (except x and y) that belong to a shortest path 
from X to y. 

If X and y are two nodes of G such that dG{x, y) > 2, then {x, y} is a cycle-pair 
if there exist a path pg{x, y) and a path Pg{x, y) such that pg{x, y)C\PG{x, y) = 
{x,y}. In other words, if {x,y} is a cycle-pair, then the set pG{x,y) U PG{x,y) 
induces a cycle in G. 

Let Gi, G 2 be graphs having node sets ViU{mi}, V 2 U{m 2 } and edge sets E\, 
E 2 , respectively, where {Vi,V 2 } is a partition of V and mi, m 2 ^ V. The split 
composition [9] of Gi and G 2 with respect to mi and m 2 is the graph G = Gi * G 2 
having node set V and edge set E = U A 2 U {(x, y) \ x £ N{mi),y £ JV(m 2 )}, 
where E^ = {(x, y) £ Ei j x, y £ Vi} for i = 1,2. 
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3 Basic Definitions and Previous Results 

In this section we recall from [7] some definitions and results useful in the rema- 
inder of the paper. 

Definition 1. [7] Let k he a real number. A graph G = (V,E) is a bounded 
induced distance graph of order k if for each connected induced subgraph G' of 
G: 



dG'{x,y) < k ■ da{x,y), for each x,y gG'. 

The class of all the bounded induced distance graphs of order k is denoted by 
BID(/c). 

From the definition it follows that every class BID(fc) is hereditary, i.e., if G G 
BID(/c), then G' G BID(fc) for every induced subgraph G' of G. 

Definition 2. [7] Let G be a graph, and {x, y} be a pair of connected nodes in 
G. Then: 

1. the stretch number sg{x, y) of the pair {x, y} is given by sg{x, y) = / 

2. the stretch number s(G) of G is the maximum stretch number over all pos- 
sible pairs of connected nodes, that is, s(G) = max^^, j,} SG{x,y); 

3. S{G) is the set of all the pairs of nodes inducing the stretch number ofG, 
that is, 5(G) = {{x,y} \ SG(x,y) = s(G)}. 

The stretch number of a graph determines the minimum class which a given 
graph G belongs to. In fact, s(G) = min{t : G G BID(t)}. As a consequence, 
G G BID(A:) if and only if s(G) < k. 

Lemma 1. [7] Let G G BID(fc), and s{G) > 1. Then, there exists a cycle-pair 
{x,y} that belongs to 5(G). 



Theorem 1. [7] Let G be a graph and k>l a real number. Then, G G BID(fc) 



if and only if cd(G„) > 



fc+i 



— 2 for each cycle C„, n > 2k -\- 2, of G. 



4 New Characterization Results 

Graphs in BID(l) have been extensively studied and different characterizations 
have been provided. One of these characterization is based on forbidden induced 
subgraphs [1], and in [7] this result has been extended to the class BID(|). In 
this section we further extend this characterization to the class BID(2 — i), for 
every integer i > 2. 

Lemma 2. Let G be a graph with 1 < s(G) < 2, and let {x,y} G 5(G) be a 
cycle-pair. Lf G is the cycle induced by pG{x,y) \J PG{x,y), then every internal 
node of pG{x,y) is incident to a chord of C . 
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Proof. Omitted. □ 

Theorem 2. Given a graph G and an integer i > 2, then G € BID(2 — i) if 
and only if the following graphs are not induced subgraphs of G: 

1. Hn, for each n > 6; 

2. cycles Cq with cd{Ce) = 1; 

3. cycles Cr with cd{Cr) = 1; 

4- cycles Cg with cd{Cs) = 1; 

5. cycles Csi +2 with cd (< 73 ^+ 2 ) = *• 

Proof. Only if part. Holes Pin, n> 6, have stretch number at least 2. Cycles with 
6, 7, or 8 nodes and chord distance 1 have stretch number equal to 2, 5/2, and 
3, respectively. Cycles C 3 i +2 with chord distance equal to i have stretch number 
at least = 2 — . Since the considered cycles have stretch number greater 

than 2 — i, then they are forbidden induced subgraphs for every graph belonging 
to BID(2 - \). 

If part. Given an arbitrary integer i > 2, we prove that every graph G ^ 
BID (2 — y) contains one of the forbidden subgraphs or a proper induced sub- 
graph G' ^ BID(2 — \). In the latter case, we can recursively apply to G' the 
following proof. 

If G ^ BID(2 — j) then, by Theorem 1, G contains a cycle G„, n > 6, 
as induced subgraph such that 0 < cd(G„) < — 2. This means that 

a cycle-pair {x,y} € S{G) generates G„. In particular, we can assume that 
G„ is induced by the nodes of the two internal node-disjoint paths Pc{x,y) = 
{x,ui,U 2 , ... ,Up,y) and pa{x,y) = {x,vi,V2, . ■ . ,Vq,y), such that p + q + 2 = n 
and cd{Cn) = q. 

If (j = 0 then we obtain the holes Pin, n > 6. If q = gt/Zf ~ 2 and n = 
6, 7, 8, 3i -I- 2, then we obtain the other forbidden subgraphs. 

Now, we show that if n > 9, n yf 3t -I- 2, and q fulfills 1 < <7 < — 2, 

then G„ contains one of the given forbidden subgraphs or an induced subgraph 
G' such that G' ^ BID(2 — i). 

By Lemma 2, every node Vk, I < k < q, must be incident to a chord of 
G„, otherwise G„ has a stretch number greater or equal to 2 and hence it is 
itself a forbidden subgraph of G. As a consequence, we can denote by rj the 
largest index f such that Vj and Uj' are connected by a chord of G„, i.e. rj = 
max{j' I (vj,Ujr) is a chord of G„}. Informally, rj gives the rightmost chord 
connecting vj to some vertex of PG{x,y). 

Notice that, if ri >3 then the subgraph of G„ induced by the nodes 
Vi,x,ui, . . . , Mri is forbidden, since it is a cycle with at least 6 nodes and chord 
distance at most 1. Hence, in the remainder of this proof we assume that ri < 3. 

Let us now analyze two distinguished cases for G„, according whether the 
chord distance q of Cn either (i) fulfills 1 < g < — 2, or {ii) is equal to 

i-n 

3i-l 
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3i-l 



(i) We consider C„ with n > 9 and chord distance q such that 1 < g < 

2. If denotes the subgraph induced by the nodes of Cn except the nodes 
x,Ui, . . . , Uri-i) then C„/ is a cycle with n' > n — 3 nodes and chord distance at 



most g— 1. To prove that C„/ is forbidden, we have to show that 

-2>g-l, 



3i-l 



-2 > q-1: 



i-n' 


- 2 > 


i-n—3i 


32-1 




32-1 



i-n—3i 

3z-l 



2> q-2, 



i-n—3i 

3i-l 




2>q, 



i-n-^4i—2 

3i-l 



2> q. 



The last inequality holds because 4t — 2 > 0 for each integer i > 1, and 
2> q. 



i-n 

3i-l 



{ii) We consider with n >9 and chord distance q such that q = | | ~ 2. 

In this case q is given whenever a fixed value for n is chosen. In general, since 
n > 9, it follows that q> 2. 

Let us analyze again the cycle Cn' - Recalling that n' > n — 3 and cd{Cn') < 
q — 1, then 



i-n' 

3i-l 



-2 > 



i-n—3i 

3i-l 



2>q-l 



is equivalent to 



i-n— 1 
3f-l 



2>q. 



In the following we show that, for every n such that 9 < n < 6i, either this 
relation holds or n is equal to 3i + 2. This means that the cycle Cn' is forbidden 
for each cycle C„, 9 < n < 6i. 



Since 



i-n 

32-1 



— 2 = q holds by hypothesis, we have to study when 



i-n— 1 
32-1 



> 



I I • This relation does not hold if and only if there exists an integer m such 
that < m < that is Then, since this equality is equivalent 

to n = 3m — TO can be equal to ^ ■ i + I only, for each integer £ > 0. As 
a consequence, n = 3m — = 3{£ ■ i + 1) — £, £ > 0. For £ = 0 we obtain 

n = 3 (but we are considering n > 9), for £ = 1 and £ = 2 the value of n is 3t + 2 
and n = 6i + 1, respectively. The cycle with 3i + 2 nodes is one of the forbidden 
cycles in the statement of the theorem. As a conclusion, the induced cycle Cn' 
shows that C„ contains a forbidden induced subgraph when 9 < n < 6z. 
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It remains to be considered the case when n > 6i + 1. In this case q = 



i-n 

3z-l 



2 implies q > 2i, and hence we can compute the value Since 



Vi, Vi-i , ... ,vi,x,ui, . . . , Un induce a cycle with chord distance i, then it has at 
most 3t+l nodes otherwise it is forbidden. In other words, Vi < 2i. The subgraph 
Cn" induced by the nodes of C„ except the nodes Vi-i, ... ,vi,x,ui, . . . , u^-i 
is a cycle with n" > n — 3i + 1 nodes and chord distance at most q — i. To prove 



that Cn" is forbidden, let us show that 



i-n 

32-1 



2 > q — i. The inequality 



i-n" 


- 2 > 


2 - (n— 32+1) 


32-1 




32-1 



2 > q — i 



is equivalent to 



The last relation holds by hypothesis, and this concludes the proof 



□ 



5 Admissible Stretch Numbers 

In [7], it was conjectured that there exists no graph G such that 2 — i < s{G) < 
2 — for each integer t > 1. In this section we show that the conjecture is 
true. Moreover, we extend the result by showing that it is possible to answer to 
the following general question: Given a rational number t > 1, is there a graph G 
such that s(G) = tl . In other words, we can state when a given positive rational 
number is an admissible stretch number. 

Definition 3. A positive rational number t is called admissible stretch number 
if there exists a graph G such that s{G) = t. 

In the remainder of this section we first show that the conjecture recalled 
above is true, and then we show that each positive rational number greater or 
equal than 2 is an admissible stretch number. 

Lemma 3. If p and q are two positive integers such that 2— i<|<2 — 
for some integer i > I, then q > i. 

Proof. Omitted. □ 



Theorem 3. If t is a rational number such that 2, — \ <t <2— for some 
integer z > 1, then t is not an admissible stretch number. 

Proof We have to show that there exists no graph G such that 2 — j < s(G) < 
2 — for each integer z > 1. 

By contradiction, let us assume that there exist an integer z > 1 and a 
graph G such that 2 — i < s(G) < 2 — By Lemma 1 there exists a cycle- 
pair {x,y} G S{G). If we assume that PG{x,y) = {x,Ui,U2, ■ . ■ ,Up-i,y) and 
PG{x,y) = {x,vi,V 2 ,... ,Vq-i,y), then PG{x,y) A PG{x,y) induces a cycle G, 
and s(G) = G By Lemma 3, the relation q> i holds; then, the node Vi exists 
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in the path pG{x,y). By Lemma 2, the node Vi is incident to a chord of C, and 
hence we can define the integer r, l<r<g— 1, such that 

r = max{j | (vi,Uj) is a chord of C}. 

Let us now denote by Cl the cycle induced by the nodes Vi,Vi-i, . . . ,Vi,x,Ui, 
U 2 , • ■ • , Ur^ and by Cr the cycle induced by the nodes Vi, Vi+i, . . . , Vq-i,y, Up-\, 
Up-2, • ■ • , Ur- In other words, the chord {vi, Ur) divides C into the left cycle Cl, 
and the right cycle Cr. 

First of all, let us compute the stretch number of the cycle Cr. Since 
PG{x,y) = {x,vi,V2, ■ . ■ ,Vg-i,y) thenpca{vi,v) = {vi,Vi+i, . . . ,Vq-i,y). Moreo- 
ver, since the path {vi,Ur,Ur+i, ■ ■ ■ ,Up-\,y) is induced in C, then its length 
implies Dc^{vi, y) > p — r + 1. Then 



s{Cr) > scn^v^y) > - — 

q i 

Since Cr is an induced subgraph of G then 

p — r + 1 ^ p 
q-i ~ q 



This inequality is equivalent to 



p ^ r - 1 
q ~ i 



From the relations 



„ 1 p r — 1 

2 - - < - < 

i q i 



we obtain that r > 2i, that is r > 2z -|- 1. 

Let us now compute the stretch number of the cycle Cl when r > 2z -|- 1. In 
this case, pc^ (x, Ur) = {x,V\,V 2 , ■ ■ ■ ,Vi,Ur) and Pc^ {x,Ur) = {x,U\,U 2 , ■ ■ ■ ,Ur). 
Then 



s(C'l) > SCL(x,Ur) 



r ^ 2z -I- 1 
z -I- 1 “ z -I- I 



> 2 - 



I 

z -I- 1 ' 



The obtained relation implies that s{Cl) > s{G). This is a contradiction since 
Cl is an induced subgraph of G. □ 



In order to show that each rational number equal or greater than 2 is admissi- 
ble as stretch number, let us consider the graph G{ni,ri 2 , ■ ■ . ,rit) obtained by 
composing t holes j • • • ? by split composition, where rzj > 5 for 

1 < z < t. The holes correspond to the following chordless cycles (as an example, 
see Figure 1, where t = 5) : 

- Hni = {h,xo,xi,m[,ri,...); 

- Hrn = {li,mi,Xi,m'^,ri, . . .), for each z such that 1 < z < t; 

- H„^ = {It, mt, xt, xt+i,rt, ...). 
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The holes are composed by means of the split composition as follows: 

G(ni,ri 2 , ... ,rit) = * • • • * 

and the marked nodes between and are m' and rrii+i, 1 < i < t, 

respectively. 

In the following, we denote by Vi {Vt, resp.) the set containing all the nodes 
of the hole resp.) but Xq, Xi, and m'l (rrit, Xt, and Xt+i, resp.); we 

denote by Vi the set containing all the nodes of the hole i/„. but rrii, Xi, and 
tUi) 1 < i <t. Finally, we denote by X the set {xq, xi, . . . , Xt+i}. 




Xl X2 X3 X4 Xs 




Fig. 1. The graph G{n\,ri 2 , ns, n 4 , ns) obtained by the split eomposition of 5 holes. The 
i-th hole has ni > 5 nodes. 



Lemma 4. Given the graph G = G(ni, n 2 , . . . , n*), the following facts hold: 

1 . SG(xo,Xt+i) = ; 

2. if i < j then pa{xi,Xj) U PcixijXj) induces a subgraph isomorphic to 

Gijii , , . . . , nj ) , 

3. ifnt > then SG(xo,Xt+i) > scixo.Xt); 

4 . there exists a pair {u,v} € S(G) such that u € X and v G X; 

5. if Ui = n for some fixed integer n, 1 < i < t, then s(G) = SG(xo,Xt+i) = 

nt— 3t+l 
i+1 ■ 

Proof. Omitted. □ 

Notice that the stretch number of nodes xq and Xt+i in G(ni,n- 2 , . . . ,nt) does 
not depend on how many nodes are in each hole; it depends only on the total 
number of nodes in G(ni, U 2 , . . . , nt) and on the number t of used holes. 

Theorem 4. If t is a rationale number such that t>2, then t is an admissible 
stretch number. 
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Sketch of the Proof. Let us suppose that t = p/q for two positive integers p and 
q.liq=l then G = i? 2 p+ 2 , if g = 2 then G = iLp+ 2 - When q > 3 we show that 
the graph G is equal to G{rii,n 2 , ■ • ■ , riq-i) for suitable integers ni, ri 2 , . . . , Uq-i. 



Let 6 = 3 + 



p-i 

9-1 



holes , . 



and r = (p — 1) mod (g — 1). Let us choose the sizes of the 
Hrig-i according to the following strategy: r holes contain 
exactly 6+1 nodes, while the remaining q — I — r contain exactly 6 nodes. By 

Fact 1 of Lemma 4 it follows that SG(a: 0 )a^g) = = = To prove 

that S{G) = p/q, by Fact 4 of Lemma 4, we have to prove that so(xi,Xj) < 
p/q, 1 < i,j < q — 2. This property holds only if we are able to determine a 
deterministic method to decide whether the hole Hn., 1 < i < q — 1, contains 
either 6 or 6 + 1 nodes. In the full paper we show that such a deterministic 
method exists. □ 



Corollary 1. For each integer i > 1, 2 — i is an admissible stretch number. 

Proof. From Fact 5 of Lemma 4, it follows that G = G{ni,U 2 , ■ . ■ ,rii-i) such 
that Uj = 5 for each 1 < j < t — 1, has stretch equal to 2 — j. □ 

The results provided by Corollary 1, Theorem 3, and Theorem 4 can be summa- 
rized in the following two corollaries. 

Corollary 2. Let t be an admissible stretch number. Then, either t > 2 or 
t = 2 — f for some integer i > 1. 

Corollary 3. For every admissible stretch number t, split composition can be 
used to generate a graph G with s{G) = t. 

Notice that, by Theorem 4, we can also use every irrational number greater 
than 2 to define graph classes containing graphs with bounded induced distance. 
For instance, BID(7 t) yf BID(fc) for every rational number k. 



6 Recognition Problem 

The recognition problem for BID(l) can be solved in linear time [1,16]. In [7], this 
problem has been shown Co-NP-complete for the generic case (i.e., when k is not 
fixed), and the following question has been posed: What is the largest constant k 
such that the recognition problem for BID (A:) can be solved in polynomial time? 

In this section we show that Theorem 2 can be used to devise a polynomial 
algorithm to solve the recognition problem for the class BID(fc), for every k <2. 

Lemma 5. There exists a polynomial time algorithm to test whether a given 
graph G contains, as induced subgraph, a cycle C„ with n > 6 and cd{Gn) < 1. 

Proof. Omitted. □ 

Theorem 5. For any fixed integer i > 1, the recognition problem for the class 
BID (2 — 1) can be solved in polynomial time. 
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Proof. For i = 1 the problem can be solved in linear time [1,16]- By Theorem 2, 
a brute-force, rather naive algorithm for solving the recognition problem for the 
class BID(2 — i), i > 1, is: test if G contains, as induced subgraph, (1) a cycle 
Cn with n > 6 and cd(C„) < 1, or (2) a cycle C 3 i +2 with chord distance equal 
to i. To perform Test 1 above, we can use the algorithm of Lemma 5, and to 
perform Test 2 we can check whether any subset of 3t -I- 2 nodes of G forms 
a cycle with chord distance equal to i. The latter test can be implemented in 
polynomial time since the number of subsets of nodes with 3i + 2 elements is 
bounded by □ 

7 Conclusions 

In this paper we provide new results about graph classes that represent a para- 
metric extension of the class of distance-hereditary graphs. In any graph G belon- 
ging to the generic new class BID(fc), the distance between every two connected 
nodes in every induced subgraph of G is at most k times their distance in G. 

The recognition problem for BID (2 — y) can be solved in polynomial time 
(Theorem 5), and the corresponding algorithm is based on Theorem 2. Can the 
same approach be used in order to solve the same problem for class BID(fc), 
fc > 2? In other words, if fc > 2 is an integer, is it possible to characterize 
BID(A:) by listing all its forbidden induced subgraphs? For instance, holes iL„ 
with n > 2k + 3, and a finite number of cycles having different chord distance. 
Unfortunately, the following theorem states that it is not possible. 

Theorem 6. For each integers k >2 and i >2, there exists a minimal forbidden 
induced cycle for the class BID(fc) with chord distance equal to i. 

Proof. Omitted. □ 

Many problems are left open. For instance, what is the largest constant 
k such that the recognition problem for BID(/c) can be solved in polynomial 
time? Moreover, several algorithmic problems are solvable in polynomial time 
for BID(l). Can some of these results be extended to BID(/c), fc > 1? 
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Abstract. The dispersion problem in a graph reqnires to find a snbset 
of vertices of prescribed size, so as to maximize the minimum distance 
between the chosen vertices. We propose efficient algorithms solving the 
dispersion problem in interval graphs, circular-arc graphs, and trapezoid 
graphs. Graphs are supposed to be represented geometrically, rather than 
by their edge sets. 



1 Introduction 

Given a graph and a number q, the dispersion problem requires to find a subset of 
q vertices so as to maximize the minimum distance between the chosen vertices. 
The problem is motivated e.g. by the location of undesirable facilities. It can 
be naturally generalized to weighted graphs, i.e. with arbitrary positive vertex 
weights and edge lengths. (The subset is demanded to have total weight at least 
q.) The problem is NP-hard for general graphs, since it includes the maximum 
independent set problem, and it can be approximated in polynomial time within 
ratio 2, see e.g. [14]. For g = 2, it is nothing else than the problem of computing 
the diameter. 

At least in the unweighted case, there is an obvious relationship between 
dispersion, /c-independent sets and fc-th powers of graphs. If maximum inde- 
pendent sets can be found in polynomial time in the class of powers of graphs 
from a class Q then the dispersion problem in Q can be solved in polynomial 
time, too. Fortunately, a number of well-known graph classes are closed under 
graph powers. However, the straightforward use of this observation yields I7(n^) 
time dispersion algorithms in such classes. In order to obtain more efficient al- 
gorithms, we have to avoid explicit insertion of edges to get the fc-th powers of 
the given graph where we look for maximum independent sets. 

To the best of our knowledge, only few subquadratic dipersion algorithms 
have been provided recently: The dispersion problem can be solved in 0{n) 
time for unweighted trees [1], in O(nlogn) time for weighted paths [14], and 
in O(nlog^n) time for weighted trees [2]. In the present note we give likewise 
efficient algorithms for interval graphs, circular-arc graphs, and trapezoid graphs. 
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2 Preliminaries 

We introduce our notion and report some very well-known facts. 

Let G = (V,E) be an undirected graph with n vertices and m edges. We 
will always assume that G is connected, without explicit notice; this is no loss 
of generality. The distance d{u, v) of two vertices u, v is the length, i.e. number 
of edges, of a shortest path connecting u and v. The diameter diam{G) is the 
maximum distance among vertices of G. A distance query in a graph takes a 
pair u,v as its input and outputs d{u,v). The edges of the fc-th power G* of G 
are all pairs uv such that d{u,v) < k in G. We have G“^ = (G“)^. 

A g-dispersion set in G is a subset S C V with [S'! = q such that md{S) = 
min{(i(u, u)| m, u G S,u ^ v} is maximized, among all subsets of V with q verti- 
ces. The dispersion problem requires to find a g-dispersion set, given G and q. A 
set S is called fc-independent if md{S) > k. A 1-independent set is simply called 
independent. Note that fc-independent sets in G are exactly the independent sets 
in G''. 

An interval family / is a set of intervals [li,ri], i = 1, . . . , n on the real line. 
The interval graph G associated with / has the intervals of / as its vertices, 
and two vertices are adjacent iff the corresponding intervals have a nonempty 
intersection. G is called the intersection graph of I. Similarly, as circular-arc 
graph is the intersection graph of a family of arcs on a circle. A trapezoid graph 
is the intersection graph of a family of trapezoids two opposite sides of which lie 
on a pair of parallel straight lines. 

In an interval family /, we may w.l.o.g. assume that all h,ri are mutually 
distinct. The vertex represented by [li,ri] is named Vi. We sometimes informally 
identify vertices and intervals. We say that I is sorted (with respect to right 
interval endpoints) if ri < . . . < r„. Instead of u = Vi, v = Vj, i < j we simply 
write u < V. 

One easily sees that a sorted interval family has the following properties: 

(1) If u < u < w and uv, uw are edges then vw is also an edge. 

(2) If M < u < w and uw is an edge then at least one of uv, vw is also an 
edge. 

An ordering of the vertices of a graph satisfying (1) is called a perfect elimi- 
nation ordering (PEO). A graph admits a PEO if and only if it is chordal, i.e. 
contains no chordless cycles of more than three vertices. All interval graphs are 
chordal, but not vice versa. 

A maximum independent set in a PEO can be found in 0{n + m) time: Scan 
the PEO from left to right. Starting with 5 = 0, add a vertex to S if it is not 
adjacent to one of the previous vertices of S. (The correctness proof is a simple 
exercise.) If a PEO additionally satisfies (2) then we may put any new vertex 
into S if it is not adjacent to the last vertex added to S. This reduces the time 
bound to 0{n). We refer to this algorithm as the greedy algorithm. 




Efficient Dispersion Algorithms for Geometric Intersection Graphs 109 



3 Efficient Dispersion in Interval Graphs 

As a remarkable result of [6], one can answer distance queries in an interval graph 
in constant time, after an 0(n) time preprocessing. Actually, the result holds on 
circular-arc graphs, and preprocessing may be done in parallel, on 0(n/logn) 
CREW PRAM processors in log n time. (We also mention that a weaker version 
with O(nlogn) preprocessing time and O(logn) answer time is easily obtained 
by standard doubling techniques.) 

Our second prerequisite addresses common PEOs of interval graphs. Common 
PEOs of chordal powers of graphs have been studied in [3,4], and the following 
lemma can be derived from these papers, however it is more convenient to give 
a short self-contained proof. 

Lemma 1. A sorted interval family is a common PEO of all powers of the 
represented interval graph. 

Proof. Consider vertices u < v, u < w with d{u, v) < k and d{u, w) < k. We 
have to show d{v,w) < k. So let P be a shortest path from u to v, and Q a 
shortest path from u to w. These paths are 1-shaped. 

Let v' and w' be the first vertex on P and Q, respectively, after u. Due to (1), 
v'w' is an edge. If u' denotes the leftmost vertex of v' , w' then we have 1-shaped 
paths from u' to v and w, of length k — 1 and k, or vice versa. By an inductive 
argument we eventually obtain 1-shaped paths from some vertex a; to u and w, 
such that one of them, w.l.o.g. to v, has length 1 or 0. More precisely, we either 
have V < X and vx is an edge, or x = v. 

In case x = v we are done, since d{x,w) < k. If d{v,x) = 1 and d{x,w) < k, 
the assertion is also proven. Assume that d{v, x) = 1 and still d{x, w) = k. By 
construction, this implies x < w' . But since u < v < x < w', property (0) yields 
that vw' is an edge. Thus d{v, w) < k. □ 

In the following we assume that the interval graph is already given as a 
sorted interval family I. If I is not sorted yet, we have to spend 0(n log n) time 
in general, but note that there are natural cases where faster sorting is possible. 
For example, if all endpoints are integers within a range of s, then I can be 
sorted in 0{n + s) time by bucketing, which is 0(n) if s = 0(ji). 

Lemma 1 immediately yields a simple and efficient dispersion algorithm: 

Theorem 1. A q-dispersion set in an interval graph can he found in time 
0{nlog{n/q^)) , if the graph is given as a sorted interval family. 

Proof. In 0{n) time, set up the data structure supporting constant-time distance 
queries. Let s denote the distance between the first vertex u and the last vertex 
v in the given PEO of G. We may obtain s by a distance query. One may also 
compute a shortest path in 0(n) time in a straightforward way, but the algorithm 
only needs the number s. 

Any shortest path {u = xq, xi, . . . ,Xg = v) is monotone in the PEO, that is 
xo < xi < ...Xg. It partitions the PEO into s consecutive blocks of vertices. 
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where the ith block contains the vertices succeeding Xi_i up to Xj. (Vertex xq is 
added to the first block.) By (1) and (2), every vertex in the fth block is adjacent 
to Xi- Hence, two vertices in the ith and jth block have distance at most k + 2, 
where k = \j — i\. On the other hand, their distance it at least k — 2, otherwise 
there would be a path from Xi to Xj shorter than k. 

We easily conclude that the cardinality qk of a maximum fc-independent set 
in G satisfies 

s — 1 
k + 3 ~ 

This can be written as 



qk - 1 



s — 1 . s — 1 



Remember that we search for the maximum k with qk > q- By maximality of 
k we have qk+i < q- Since the left-hand inequality holds similarly for fc -|- 1, we 
finally get 



s — 1 s — 1 

4 < fc < -k 1. 

q q-l 



The difference is 0{s/q^) = 0{n/q^). Thus it suffices to search for k among 
0{n/q^) candidates. Since qu decreases monotone with k, we can use binary 
search. 

Due to Lemma 1, for computing a maximum /c-independent set in G we 
may apply the 0{n) time greedy algorithm to the given PEO, with the only 
modification that adjacency queries to G^ are replaced with distance queries to 
G. □ 



In particular, the time bound is 0(n) if <7 = Q{y/n), but only 0(n log n) 
for smaller q. For an acceleration we have to exploit more structure of interval 
graphs. 



First note that the fc-independent sets computed by the greedy algorithm 
(in the fc-the power) consist of (inclusion-)minimal intervals only: Assume that 
the greedy algorithm puts u and next some non-minimal interval v into S. Since 
V includes some minimal interval w, and w precedes v in the ordering, w is 
considered before v. But d{u,v) > k obviously implies d{u,w) > k, hence w 
would be chosen, contradiction. Similarly, the first interval in S' is a minimal 
one. 

We conclude that the following modified greedy algorithm outputs a maxi- 
mum fc-independent set in 0(n) time: Mark all minimal intervals. Scan the PEO 
from left to right. Starting with S = 0, add a vertex to S if it is not adjacent to 
the last vertex added to S AND it is a minimal interval. 

We can also formulate: 

Lemma 2 . If u < v < w, w is a minimal interval, and d{u, w) < k then 
d{u, v) < k. 
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Proof. Let P be a shortest path from u to w. Since some edge of P bridges v, 
by (2), V is adjacent to some endvertex of that edge. This gives immediately 
d{u, v) < k, unless the bridging edge is the last one of P, and v is adjacent to w 
only. But then interval w includes v, contradiction. □ 

A block in an ordered interval family / is a set of consecutive elements, with 
respect to the ordering. 

Lemma 3. The vertices in a block of size b have mutual distances at most 2b. 

Proof. It suffices to show that two consecutive vertices u < v have distance at 
most 2. As we know, any shortest path P from t6 to u is 1-shaped. Let w be the 
second vertex on P. Since no vertex is between u and w, we have v < w. By (2), 
uv or vw is an edge, or w = v. □ 

Now we are ready to present our fast dispersion algorithm. 

Theorem 2. A q-dispersion set in an interval graph can be found in time 
0(n log log n), if the graph is given as a sorted interval family. 

Proof. Mark the minimal intervals. This can be done in 0{n) time in a straight- 
forward way. Set up the data structure for constant time distance queries, in 
0(n) time. Partition / into blocks of approximately logn vertices each. For fixed 
k we determine a maximum fc-independent set in the following way. 

li k < 2 log n then apply the simple 0(n) time greedy algorithm. 

If k > 2 log n then build a set S as the greedy algorithm does, but consider 
only the minimal intervals, and check only the first minimal interval v in every 
block. If the distance of v to the currently last element t6 G S' is at most k then 
this is also true for all vertices in the preceding block, by Lemma 2. If d(u, v) > k 
then search the preceding block plus v for the leftmost minimal interval having 
distance larger than k to u. Put this vertex into S to be the new u. By Lemma 
2 again, it suffices to perform binary search in the block, rather than testing all 
candidates. Since, by Lemma 3, the distances within the block are bounded by 
2 log n < k, no further vertex from the same block can be added to S. Thus we 
need no more than O(loglogn) distance queries in each of the n/logn blocks. 

Finally, we have to search for the maximum k which admits a /c-independent 
set of size q. For k < 2 logn we need O(loglogn) steps of binary search, each 
of duration 0{n). The O(logn) steps with k > 2 logn need 0(n log log n/logn) 
time each. □ 

It seems that we cannot extend the result to strongly chordal graphs, a class 
including both trees and interval graphs and being closed under graph powers. 
Even if a strong elimination ordering is already given, it is unlikely that a com- 
mon PEO of all powers can be efficiently constructed. (The results of [4] discou- 
rage such a conjecture.) In particular, a strong elimination ordering itself does 
not have this property in general; this is wrong even for trees. Moreover, a sub- 
quadratic preprocessing that supports fast distance queries in strongly chordal 
graphs is missing. 
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4 Extension to Circular-Arc Graphs 

Similarly, the results of [6], [9], and [11] can be combined to an o(n^) dispersion 
algorithm for circular-arc graphs, provided that the graph is given as a circular- 
arc family, sorted by their right endpoints. The “left” and “right” endpoint of 
an arc is understood with respect to the clockwise orientation. In this section 
we outline such an algorithm. 

As shown in [11], a certain extension of the greedy algorithm on interval gra- 
phs computes a maximum independent set in a circular-arc graph: First start on 
an arbitrary vertex u. Scan the circle clockwise and find the next right endpoint 
of an arc v being not adjacent to u. Continue with u := v, and so on. Mark every 
visited vertex. As soon as some vertex u is visited the second time, the process 
runs into a “cycle” from which it is quite easy to select a maximum independent 
set. (The principle was independently discovered also in [12,15].) 

A suitable implementation of this algorithm via some auxiliary directed graph 
runs in 0(n) time. It is important to notice that the greedy algorithm uses two 
things only: the circular ordering of vertices with respect to right endpoints of 
the representing arcs, and adjacency queries. 

In [9], the following has been proven: 

Lemma 4. If is a circular-arc graph, and k = 1 or diam{G^) > 4, then 
is also a circular-arc graph. Moreover, an arc representation of is 

obtained from one of G^ by extending the arcs in one direction only, w.l.o.g. 
counterclockwise. 

It follows that all powers G^ of a circular-arc graph G with k = 2 or k < 
diam{G) /3 -I- 1 have a common circular ordering of the right endpoints. Hence 
we can find, for any such k, a maximum /c-independent set in 0(n) time: Apply 
the greedy algorithm to the given circular ordering, while replacing adjacency 
queries to G^ with distance queries to G. 

Altogether we obtain: 

Theorem 3. A q-dispersion set in a circular-arc graph can be found in time 
0(n log n), if the graph is given as an arc family, and q >7. 

Proof. Let A be the given arc family with intersection graph G. Check in 0(n) 
time whether A covers the entire circle (otherwise G is an interval graph, and we 
are done). Let c be the minimum number of arcs from A which cover the whole 
circle. It is not hard to see that diam{G) > [c/2j . 

Search for the largest k that admits a fc-independent set of size at least q. 
If we consider some k > max{3, <izam(G)/3 -I- 1} during our search, we have 
k > (c — I)/6-|- I. Since there exists a circle cover of size c, any fc-independent 
set in G has at most c/(fc — 1) vertices. Trivially, this is at most c/2, and for 
c > 13, we have c/(fc — 1) < 6c/(c — 1) < 6.5. In either case, these are less 
than q vertices. Therefore, also the greedy algorithm applied to G* (which is not 
guaranteed to work correctly for such fc!) cannot supply a fc-independent set of 
size q. On the other hand, it works correctly for fc < diam{G)/3-\-l, as discussed 
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above. Hence we can use binary search to find the best k, which implies the time 
bound. □ 

Thus we have a very short proof of the O(nlogn) bound, but annoyingly, 
this approach fails for g < 6. In order to close this gap, we have to make an 
additional effort and generalize the maximum independent set algorithm itself. 
We follow the lines of [11]. 

Given an arc family A, an arc in A is called a min-arc if it includes no other 
arc from A. The min-arcs in A form a cyclic ordering in an obvious way, w.l.o.g. 
clockwise. If A is already sorted, one can recognize the min-arcs and construct 
this ordering in 0(n) time. For any positive integer k, trivially, there exists a 
maximum fc-independent set in the intersection graph G of A that consists of 
min-arcs only. Hence it suffices to search for such fc-independent sets. 

For a min-arc v we define NEXT}^{v) to be the first min-arc w following v in 
the cyclic ordering, such that d{v,w) > k. Note that w does not always exists, 
in that case NEXTk{v) is undefined. Define a directed graph Dk whose vertices 
are the min-arcs of A, with directed edges {v, NEXTk{v)). Note that any vertex 
of Dk has at most one outgoing edge. A moment of thinking reveals that any 
vertex having an ingoing edge must also have an outgoing edge. It follows that 
any path starting in a non-isolated vertex leads to a cycle (not in a dead end). 

For a min-arc x let C{x) denote the maximal path in Dk starting with x, 
such that all y in C{x) satisfy d{x,y) > k. 

Lemma 5. The sequence C{x) is embedded into the cyclic ordering of min-arcs, 
and the vertices of C{x) form a k-independent set, 

Proof. For the first assertion, note that no directed edge (of Dk) in C{x) can 
jump over x in the circle, otherwise we had a contradiction to the definition of 

Dk- 

Assume that C{x) is not /c-independent. Then there exist vertices u,v being 
non-neighbors in C{x) with d{u,v) < k. Removal of arcs u and v divides the 
circle in two parts. (In the following assume that u and v do not intersect, that 
is d{u,v) > 1. However case d{u,v) = 1 can be settled similarly.) Consider a 
shortest path P connecting u and v. The union of arcs in P must cover at least 
one of the two mentioned parts completely. Since u, v are non-neighbors in C{x), 
w.l.o.g. w = NEXTkiu) is between u and v in this part and is therefore adjacent 
to some vertex of P. This contradicts d{u, w) > k, unless v is the only vertex 
of P adjacent to w. But this is also impossible, as u is a min-arc, and w is 
between u and u on a segment covered by P. Thus we have shown that C{x) is 
fc-independent . □ 

Lemma 6. C{x) is a largest k-independent set containing x. 

Proof. C{x) uniquely partitions the cyclic ordering of min-arcs into segments, 
each beginning with the next member of C{x). Assume that there is a larger 
fc-independent set C' 9 x, consisting of min-arcs only. Some of the mentioned 
segments beginning with, say, u G C{x) must contain two vertices v,w € C' , 
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with V before w, thus u^w. By construction of C{x) we have d{u,w) < fc. If a 
shortest path from u to w covers v then we get a contradiction to d{v,w) > k, 
as in the previous proof. Hence a shortest path P from m to w must cover x 
(including case u = x ai the moment). Since x,w € C' we have d{x,w) > k 
which implies u ^ x, and u is the only vertex of P adjacent to x. Again this 
contradicts the fact that m is a min-arc. □ 

If X has an outgoing edge (x,y) then C{y) has at least as many vertices 
as C{x): Note that C(x) contains y, and C{y) is a largest fc-independent set 
containing y. In particular, for all x within a cycle of Df^, the C{x) have equal 
size, and some cycle must consist of vertices x such that every C{x) is a maximum 
/c-independent set. Using constant-time distance queries in G, it is now rather 
straightforward to conclude: 

Theorem 4. A maximum k-independent set in a circular-arc graph can he fo- 
und in 0{n) time, provided that the graph is given as a sorted arc family. Con- 
sequently, a q-dispersion set can he found in 0(n log n) time. 

Similarly as in the interval graph case, we can improve this time bound for 
large enough q. 

Theorem 5. A q-dispersion set in a circular-arc graph can he found in time 
0{nlog{n/q^)), if the graph is given as a sorted arc family. 

Proof. Compute a greedy covering of the circle in 0(n) time. That is, start in 
an arbitrary point and extend the covered segment counterclockwise, as much 
as possible, by adding a new arc, until the whole circle is covered. Let c denote 
the number of arcs in this greedy covering. Any fc-independent set has at least 
[c/{k -\- 1)J and at most c/(fc — 1) vertices. Given q, it suffices to search for the 
maximum fc in a range of O^cfq"^) values. □ 

Here it is left open whether the O(nloglogn) technique for interval graphs 
can be extended to circular-arc graphs. The difficulty is to find, in sublinear 
time, some x in a cycle of Dk which yields maximum C{x). 

5 Dispersion in Weighted Trapezoid Graphs 

For permutation graphs, an 0(n) time preprocessing that supports answering 
distance queries in constant time is known [5]. In trapezoid graphs one has 
O(nlogn) preprocessing time and 0(log/c) answer time if the distance is k [10]. 
Permutation graphs are contained in the class of trapezoid graphs which is closed 
under powers [8], and maximum weight independent sets in trapezoid graphs 
can be found in O(nlogn) time [7]. (The algorithm in [13] is not useful for our 
purpose, as it is linear in the number of vertices and edges which may be 0{n^).) 

However this does not lead to an o(n^) dispersion algorithm in the same way 
as in the previous sections, because things do not go well together. Remember 
that in previous cases (i) powers preserve some common vertex ordering, and (ii) 
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the independent set algorithm used relies on this ordering and adjacency queries 
in only (which are realized by distance queries in G). The algorithm in [7] 
violates (ii), since it explicitly uses the coordinates in a trapezoid representation 
(or equivalently, in a box representation) of the graph. 

Nevertheless we obtain an 0(nlog^ n) time dispersion algorithm in weighted 
trapezoid graphs in another way, without using fast distance queries. The key is 
the following lemma being implicit in [10]. 

Lemma 7. A trapezoid representation of G^ can be constructed in 0(n log n) 
time from a trapezoid representation of G. 

Together with an 0{n log n) time algorithm for maximum weight independent 
sets in trapezoid graphs [7], we get immediately: 

Theorem 6. The dispersion problem for trapezoid graphs with positive vertex 
weights can be solved in O(nlog^n) time, if the graph is given as a trapezoid 
family. 
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Abstract. We examine a network upgrade problem for cost flows. 
A budget can be distributed among the arcs of the network. An in- 
vestment on a single arc can be used either to decrease the arc flow cost, 
or to increase the arc capacity, or both. The goal is to maximize the flow 
through the network while not exceeding bounds on the budget and on 
the total flow cost. 

The problems are NP-hard even on series-parallel graphs. We provide an 
approximation algorithm on series-parallel graphs which, for arbitrary 
5, e > 0, produces a solution which exceeds the bounds on the budget 
and the flow cost by factors H-5 and H-e, respectively, while the amount 
of flow is at least that of an optimum solution. The running time of the 
algorithm is polynomial in the input size and l/[5e). 



1 Introduction and Related Work 

Weighted graphs can be used to model a wide range of problems, e. g. in the 
area of transportation, logistics, and telecommunication. While the underlying 
instances are static, realistic applications often admit local improvements in- 
stead of re-implementing the whole network. This leads to the area of network 
upgrade problems. All these problems have in common that investing a budget 
on parts of the graph allows to change weightings on the graph elements while 
the topological structure of the graph remains untouched. 

Popular models in the area of network upgrade problems are the node upgrade 
model, where upgrading a node decreases the length of all incident edges [11,4,5], 
and the edge upgrade model, where upgrading an edge decreases its length [10,6]. 
In [7] the authors investigate a cost flow problem where a budget invested on the 
arcs can be used to increase the capacity of that arcs. Given a flow value F, the 
goal is to compute an upgrade strategy of minimum cost such that the resulting 
network admits a flow of at least F. The authors give an FPAS on series-parallel 
graphs. 

In this paper we investigate a cost flow problem where the budget can be 
used to decrease the unit flow costs or to increase the capacity of arcs indepen- 
dently. This model is motivated e. g. in the case of a transportation company 
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specialized on perishable food. Upgrading the capacity, i.e., using larger trans- 
portation containers, involves even the use of larger cooling aggregates such that 
the transportation costs per unit of food are not affected. On the other hand, 
by investing in the infrastructure one can lower the unit costs without affecting 
capacities. 

We provide hardness results on series-parallel graphs (Section 3) and give an 
approximation algorithm in the spirit of an FPAS (Section 4,5). In Section 6, 
we evolve a combined problem — decreasing flow costs and increasing capacities 
simultaneously — which turns out to be solvable within essentially the same run- 
ning time. 

2 Preliminaries and Problem Formulation 

A flow cost problem is defined by a directed graph G = {V,R) with arc capaci- 
ties u and arc costs c. An arc function x is called a (feasible) flow from s to t 
(s, t G V), if 0 < a; < u for each arc, and for each node v G U \ {s, t} the inflow 
equals the outflow. The flow value F{x) is given by the net outflow of node s. 
The cost of a flow x is given by 

We model a scenario where the flow costs can be decreased by investing a 
budget on the arcs. To this end, we are given a price function p and a discount 
function d on the set of arcs. An investment on the graph is described by an 
arc function y with the following meaning: An investment of y(r) units on arc r 
costs y{r)p{r), while the flow cost reduce from c{r)x{r) to (c(r) — y{r)d{r))x{r). 
An additional arc function Cmi„ specifies a lower bound on the flow costs. 

Naturally, we have the following restrictions on the functions: u, c > 0, c > 
Cmin, and 0 < d < c — Cmin- Thesc restrictions are without loss of generality: 
The case c(r) = 0 can be modeled by setting c(r) = d(r) = 1 and p(r) = 0. 
If c(r) = Cmin(r), this means that the flow costs can not be lowered; this can 
be modeled by setting p(r) to some large constant which exceeds the available 
budget. The same applies to the case d(r) = 0. 

Definition 1 (FCLP). An instance of the flow cost lowering problem (FCLP) 
is given by a directed graph G = (U, i?) with terminals s,t G V and functions 
u, c, Cmin,P, d\ R ^ IMo restricted as noted above. Further, we are given constants 
S, C G IM as limits on the upgrade budget and the flow cost, respectively. 

The goal is to find a solution a = {x, y), specified by a feasible flow x and an 
upgrade strategy y on the arcs, such that 

1. (upgrade investment) B{u) := '^r&jiy{'f’)p{r) < B, 

2. (flow cost) G(a) := J^^^jf(c(r) - y(r)d(r))x(r) < G, 

3. (flow value) F{u) := F(x) is maximized. 

Using a canonical notation for multi-criteria optimization problems [8] , FCLP 
is denoted by (flow value: max, upgrade investment: upper bound, flow cost: 
upper bound). A solution a is called feasible, if B{a) < B and G{a) < G. An 
approximation algorithm with performance (oi, 02 , 03 ) is a polynomial running 
time algorithm, which returns an almost feasible solution (i.e., a solution with 
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B{a) < a 2 ■ B and C(ct) < «3 • C) with F{a) > l/a\ ■ F{a*), where a* is an 
optimal feasible solution. If there is no feasible solution, the algorithm is free to 
provide this information or to return any almost feasible solution. 

For multi-criteria optimization problems, by interchanging bounds and opti- 
mization objective one gets a family of related problems. Results on the hardness 
carry over to the related problems. Moreover, for integral objectives as in the 
case of FCLP, even approximability factors carry over to related problems [12,9, 
8,1]. For bicriteria problems, related problems are also known as dual problems. 

A series-parallel graph is defined by the following rules: The single arc (s, f) 
is a series-parallel graph with two terminals, the source s and the target t. For 
series-parallel graphs Gi, G 2 with sources si, S 2 and targets t\,t 2 , respectively, a 
new series-parallel graph with source si and target t 2 can be obtained by either 
identifying t\ = S 2 (series composition), or identifying si = S 2 and t\ = t 2 
(parallel composition) . 

For graph G = {V, R), we use the abbreviations n := \V\ and m := |i?| where 
appropriate. 



3 Hardness Results 

In this section we show the hardness of FCLP and provide lower bounds on the 
approximability. 

Theorem 1 (Non-approximability of FCLP). For any (polynomial time 
computable) function a{n), the existence of an {1,1, a{n))~ approximation algo- 
rithm for FCLP on series-parallel graphs with n nodes implies P = NP. 

Proof. We show the hardness by a reduction from Knapsack [3, Problem MP9]. 
An instance of Knapsack is given by a finite set A = {oi, . . . ,Ofc} of items, 
each of weight w{ai) > 0 and value u(oj) > 0, and two numbers W,V € IN. It is 
NP-complete to decide whether there is a subset A' C A such that w(A') < W 
and v(A') > V. 

Assume there is a (1, 1, a)-approximation algorithm for FCLP. Given an in- 
stance of Knapsack, we construct a graph with vertex set {s, t} joined by k par- 
allel arcs. For item Oj, arc has capacity u{ri) := v{ai), and price p{ri) := w{ai). 
Further, for all arcs set the initial flow cost c(r) := a ■ V -\- 1, the minimal flow 
cost Cmin{r) ■= 1, and the discount d{r) := a ■ V. Finally, set the flow cost 
constraint G := V, and the budget constraint B := W. 

We claim that there is a solution of the FCLP instance with flow value at 
least V if and only if there is a solution to the Knapsack instance. 

Assume that there is a solution of FCLP with flow value F >V obeying both 
constraints. Let R' be the set of upgraded arcs, choose A' to be the corresponding 
set of items. Due to the flow cost constraint it is easy to see that the solution 
of FCLP uses only upgraded arcs. Then X^a'eA' ) = Er'eR' >B>V 
and X^o'gA' = Er'^R' B = W, therefore A' is a solution for 

Knapsack. Conversely, constructing a solution for FCLP out of a solution of 
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Knapsack is easily achieved by upgrading the arcs corresponding to the items 
of the solution. 

Since the flow cost of an arc r which is not upgraded exceeds a ■ V , any 
(1, 1, a)-approximation algorithm for FCLP must in fact solve the underlying 
Knapsack problem exactly. □ 



Theorem 2 (Non-approximability of FCLP). For any a < ln(n/2 — 1), the 
existence of a {1, a, 1)- approximation algorithm for FCLP on bipartite graphs 
with n nodes implies NP C 

Proof. The proof uses a reduction from Dominating Set (DS) [3, Problem 
GT2]. An instance of DS is given by a graph G = (V, E) and a number AT G IN. 
A subset y' C K is called dominating, if each node in V is either contained in V' 
or incident to a node from V' . It is NP-complete to decide whether G admits a 
dominating set of size at most K. 

Given an instance G = (V, E) of DS, construct a graph G' with node set 
V{ U 1^2 U {s, t}, where each Vf is a copy of V. Insert an arc between v £ V{ and 
w £ Vf if and only if w is dominated by v in the original graph. For each v £ V{ 
insert an arc (s,v) of cost c := 2, discount d := 1, minimum cost Cmm := 1 and 
price p := 1. For each w £ Vf add an arc (w, t) of capacity u := 1. If not specified 
yet, set the capacity on the remaining arcs to u := n, and the costs and prices 
to zero. Notice that the resulting graph is bipartite, and only the arcs incident 
to s cause flow costs. 

It is easy to see that a solution for FGLP with budget constraint B, flow cost 
constraint n, and flow value n implies a dominating set in the original graph of 
size at most B. Gonversely, a dominating set of size B implies the existence of a 
solution of FGLP for budget constraint B and flow cost constraint n which has 
flow value n. 

Assume that there is a (1, a, l)-approximation algorithm for FGLP. By trying 
all n instances with budget constraint B = 1, . . . ,n and flow cost constraint n 
and taking the best solution, we get an a-approximation algorithm for Minimum 
Dominating Set. Using the result of Feige [2] and the fact that the number of 
nodes in the constructed graph satisfies n' = 2n + 2, we can conclude that a 
(l,ln(n/2 — 1), l)-approximation algorithm for FGLP on bipartite graphs with 
n nodes implies NP C DTIME(A^O(i°8'°s^)). □ 

4 Algorithm for FCLP on Series-Parallel Graphs 

In this section we provide a pseudopolynomial time algorithm which solves FGLP 
on series-parallel graphs exactly. The main idea is to And a distribution of the 
given budget and flow cost on the arcs such that the resulting flow value is 
maximized. 

To this end, we define 



OPTg(/3,7) 
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to be the flow value F{a*) of an optimal solution a* for FCLP with budget 
limit /3 and cost limit 7 on graph G. We give an algorithm 



A{G, u, c, d,p; B, G) 



which computes the value OPTg(_B, G) and the corresponding upgrade strategy 
and flow. 

First we compute a decomposition tree of the series-parallel graph G. Schoen- 
makers has shown that in time 0{n + m) one can compute such a decomposition 
tree or give the information that the graph is not series-parallel [13]. Then, an 
optimal solution can be built by traversing the tree bottom-up. Notice that the 
size of the decomposition tree is in 0{m) C 0{n). 

For a single arc r, the numbers OPTj.(/3, 7 ) for all values 0 < P < B and 
0 < 7 < C can be determined in total running time 0{BC): Since 



P 



[p{r) 



y(r) = min 

it follows that 

OPTj.(/3, 7 ) = x{r) = min 



C(r) - Cmin(r) 

d{r) 



c{r) - d{r)y{r) 



]■ 



,u{r) 






Notice that the above expressions yield finite numbers even if one of the deno- 
minators equals zero. 

If G is the series composition of two subgraphs G\ and G 2 , then 



OPTg(/ 3,7) = min{OPTGi(z,j),OPTG,(/3-i,7-j)}. 

0<i</3 

0<1<7 

If G is the parallel composition of G\ and G 2 , then 



OPTg(/ 3,7) = OPTGl(^,J)-kOPTG2(/3-^,7-J)• 

o<^<p 

0<j<7 



Assuming that the numbers OPTG^(i,j) on the subgraphs are already computed 
for all values 0 < i < P and 0 < j < 7 , the running time for calculating 
OPTg(/3, 7 ) is in 0{P^) in both cases. Since the decomposition tree of G consists 
of 0{m) nodes, the total running time for computing number OPTg(B, G) (and 
simultaneously keeping track of the corresponding flow and upgrade strategy) is 
in 0{mB'^G^). 

5 Polynomial Time Approximation Algorithm 

In this section we use a scaling technique derived from [7] to achieve a polynomial 
time approximation algorithm for FCLP with performance guarantee (1, 1-1-5, 1-1- 
e) for arbitrary 5, £ > 0. 




Optimizing Cost Flows by Modifying Arc Costs and Capacities 121 



The running time of the algorithm presented in the previous section is in 
0{mB‘^C^) when called with bounds B and C on the investment budget and 
total flow cost. In order to achieve a solution a = (x, y) in polynomial time we 
scale the bounds to polynomial size. To this end it is also necessary to scale the 
costs and prices on the arcs by appropriate factors. The budget and cost value 
of (7 on the original (unsealed) instance exceed the given bounds by small factors 
due to the rounding errors arising from the scaling process. In exchange to this 
inaccuracy the resulting flow value is at least the optimum flow value. 

As a preprocessing step, we first set up 



x{r) := u{r) and y{r) 



c(r) - Cniin(r-) 
d{r) 



and Cr{r) := c{r)x{r)y{r) , dr{r) := d{r)x{r)y{r) , and Pr('c) := p{r)y{r). As we 
will see later, this modification of the scaling technique from [7] helps to make 
the rounding error on each arc independent of the size of investment and flow 
on an arc. 

Algorithm A is modified as follows. To compute the value OPTr(/3, 7) for a 
single arc r, it computes 



y{r) 



= min 



{ 



fjyjr) 

_Pr{r ) . 




( 1 ) 



and hence 



0PT,(/3,7) 




7 x(r)j/(r) 
Cr{r) - dr{r)y{r) 




(2) 



The scaling is performed as follows. Given <5, £ > 0, choose e' := 3/4 • £ and 



set 




Pr = 



Cr r ZmCr ’ 

Cei — Ce' 



dr r 3mdr ' 

Ce' Me' 



I Pr 

[(l + i)3mjy- 

rSmprl 

I BS I 



if C yf 0, 
otherwise, 

if C yf 0, 
otherwise, 
if B = 0, 

1 Ap>BAB^0, 
otherwise. 



Notice that ,d'^ > 0 , since m,Cr,dr,e' > 0. Moreover, p^ = 0 
0, since m,S > 0. Algorithm A is then called with parameter list 

A{G,x,y,c^,d^,pf] [(l + l/(5) 3mJ , [(1 + l/£') 3mJ) . 



Pr = 



Notice that the running time of the algorithm is now in 0{m^{l + j)^(l + 

The following lemma shows that the solution exceeds the given bounds on 
the investment budget and the flow cost only by small factors. 
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Lemma 1. For any solution a returned by the algorithm A, we have B{a) < 
(1 + 5)B and C{a) < (1 + e)C. 



Proof. We have 






B6 ^ pf 



y 



3m 



reR 



reR reR 



On the other hand, 



C{a) = E < E ^ 



r£R r&R 

Ce' X — d^ xy xy 



3m 

xy 



< 



3m 

Ce' 

3m 



E 

reR 



xy 



xy 



^ 3m + m^ = C = (1 + e)C . 



At this point, we profit from the preprocessing step, since the rounding error 
per arc, xy/xy < 1 , is independent of y. □ 



It remains to show that the overall flow value is at least the flow value of an 
optimum solution. 



Lemma 2. For any solution a returned by the algorithm A, we have F{u) > 
OPTg{B,C). 

Proof. Let a* = (x*,y*) be an optimal solution of the original (unsealed) in- 
stance. (Notice that FCLP always has a feasible solution, e.g. (a; = 0,y = 0).) 
We show that there is a solution a = (x, y) on the scaled instance with x > x* 
and y > y*. Since the algorithm computes all possible distributions of the bud- 
get and the flow costs, it suffices to give one appropriate distribution (/3, 7 ) of 
budget and flow cost. The detailed proof is of rather technical nature and can 
be found in the appendix. □ 



The following theorem summarizes the results of the current section. 

Theorem 3 (Approximability of FCLP). For any 5, e > 0, there is an ap- 
proximation algorithm for FCLP with performance (1, 1 -I- i5, 1 -I- e) . Its running 
time is in 0(m^(l -I- j)^(l + ^)^)- D 



6 Extension to Augmentable Arc Capacities 

In this section we extend FCLP to cover a situation where the invested budget 
can also be used to augment the capacities of the arcs. The resulting problem is 
called flow cost lowering capacity augmenting problem (FCLCAP). 
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An instance of FCLCAP is given by a directed graph G = {V,R) with ter- 
minals s,t € V and functions u, Umax, c, CmimPd, d,pa, a: R ^ INq. The meaning 
of the parameter functions is similar to those given in Definition 1. Wmax speci- 
fies an upper bound beyond which the capacity of an arc can not be increased. 
Analogously to the function pair pd/d, the function pa specifies the price for 
augmenting the arc capacity by an amount of a. Note that the available budget 
can be split arbitrarily to decrease the flow cost or increase the capacity. 

The goal is to And a solution a = (x, pa, yd), specifying a feasible flow x and 
investment strategies ya,Vd for increasing capacities and lowering flow costs, 
respectively, such that 

1. (investment) B{a) := J2r(^Rya{r)pa{r) + yd{r)pd{r) < B, 

2. (flow cost) C'(cr) := Erefl(c(^) “ yd{r)d{r))x{r) < C, 

3. (flow value) F{a) := F{x) is maximized. 

Notice that the condition on the feasibility of x is changed to x{r) < u{r) + 
ya{r) -a(r). The canonical notation of FCLCAP is (flow value: max, upgrade and 
augmentation investment: upper hound, flow cost: upper hound). 

It is easy to see that FCLCAP contains FCLP as a special case. Therefore all 
hardness and non-approximability results immediately carry over to FCLCAP. 



Theorem 4 (Approximability of FCLCAP). For any 6,e > 0, there is an 
approximation algorithm for FCLCAP with performance (1, 1 -I- i5, 1 -I- e). Its 
running time is in + j)^(l + Jj)^)- 

Proof. The proof is similar to the proof given in Section 5. The main difference 
consists in the behavior of the underlying pseudopolynomial algorithm A. The 
computation of OPTr(/ 3 , 7 ) for a leaf r of the decomposition tree is modified as 
follows: With 



ya{r) = min 
yd{r) = min 

we have 

OPTi.(/3, 7 ) = x(r) = max min 
pa+yd=y 



fla 

_Pa{r)_ 

fld 

_Pd{r)_ 



Mmax(r) - u{r) 
a{r) 

c(r) - Cmin(r) 
d{r) 



1 } 

}■ 



and 



_c(r) - d{r)yd{r)j 



,u{r) + a{r)ya{r) 



The computations for interior nodes of the decomposition tree are exactly the 
same as described above. 

After changing the definition of x to 



x{r) 



•Umax(r-) - u{r) 
a{r) 



a{r) + u{r ) , 



the scaling technique outlined in the previous section can be applied to FCLCAP 
straightforward. This leads to the stated result. □ 
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7 Conclusions and Further Remarks 

We have given approximation algorithms for FCLP and FCLCAP with perfor- 
mance (1, 1 -|- 5, 1 + s) for arbitrary S,e>0 and running time polynomial both 
in the input size and in l/(Se). Analog results we could show for the related 
problems with interchanged optimization objectives. 

On the other hand, we have shown that (l,l,a(n)) is a lower bound on 
the approximability of FCLP and FCLCAP on series-parallel graphs for any 
(polynomial time computable) function a{n). A lower bound of (1, ln(n/2 — 1), 1) 
could be shown to hold on bipartite graphs, where n denotes the number of 
nodes in the graph. All results rely on the common conditions P yf NP or 
NP 2 DTIME(Ad^(^°s'°8^)), respectively. 

In realistic applications often problems arise where upgrading of an arc is not 
performed continuously but in one single step. There are examples where this is 
true for arc capacities as well as for arc flow costs. In our model, these cases are 
handled by restricting ya or yd to take on values only from {0, 1} by setting Mmax 
and Cmin appropriately. Therefore the approximation results also carry over to 
those 0/1-cases. 
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8 Appendix 



Proof (of Lemma 2). For shorter notation we omit parameter r, i. e. we write / 
instead of /(r) for all functions / where there are no ambiguities. Assume 



/3(r) := 

I y I 

for each arc r G R. Clearly, y > y*. On the other hand, if > 0, 



Pr * 

—y 

V 



(iff 

p? 



^-ly* 

y ^ 
Pr 



y 



> 



^y* 

ii ^ 

-By 

Pr 



= y ■ 



Hence, by (1) it follows y > y*. 
Assume 



7 (r) := 



X* (cp -dffy*) 

xy 

for each r G R. Clearly, x > x*. Moreover, 







x*{cf-dfy‘) 


xy 


v>y* 

> 




jxy 




xy 


xy ^y 


- d^y_ 




- d(ty 




cp — d^y* 



= X 



By (2), we have x > x*. Since the source s has no incoming arcs, this implies 
F{a) > F{a*) = OPTg{B, C). 

It remains to show that ct is a feasible solution on the scaled instance. 



reR 



p? * 

—y 

y 



r&R r&R 

PAO PAO 

3mpr I 1 \ 

“ !/' + l =E 



reR 

PAO 



y 
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3m 
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reR 

PAO 



( 3mpry* y* 



E 

reR 

•PAO 



Pry 

By 



„ 3m 

-I- 2m < — ^ -I- 2m < 
0 



3m 

— ^ -I- 3m 

0 



(1 -I- y)3m 

0 
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On the other hand, 






r£R r&R 



x*c^ — d^x*y* 
xy 



< 



^R \ iy ) 



-E 

reR 



/ SmCrX* X* 3mdrX*y* 

\ Ce'xy xy Ce'xy 

2m 



+ 1 



3m 
<; 


i ^ c^x 


* - drX*y* 


)• 


“ s' 


1 ^ 

\reR 


Cxy 


3m 


+ 2m < 


3m 

— - + 3m 




s' 




- e' 





(1 + -:)3m 
e' 



This shows the claim. 
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Abstract. We introduce the notion of update networks to model com- 
munication networks with infinite duration. In our formalization we use 
bipartite hnite graphs and game-theoretic terminology as an underlying 
structure. For these networks we exhibit a simple routing procedure to 
update information throughout the nodes of the network. We also intro- 
duce an hierarchy for the class of all update networks and discuss the 
complexity of some natural problems. 



1 Introduction 

A network can be viewed as a finite directed graph whose nodes represent a type 
of system (e.g., processors, data repositories, and servers) and edges represent 
communication channels between the nodes. The process of communication in a 
network is essentially an infinite duration process. The interacting network has 
to be robust (e.g., be fault-tolerant and free of dead-locks). We are interested in 
those networks that possess liveness and fairness properties. Liveness means that 
all nodes are actively participating in the collective system. Fairness means that 
critical information of the dynamic network is distributed to all of the nodes. 

To give an example, suppose we have data stored on each node of a net- 
work and we want to continuously update all nodes with consistent data. For 
instance, we are interested in addressing redundancy issues in distributed da- 
tabases. Often one requirement is to share key information between all nodes 
of the distributed database. We can do this by having a data packet of current 
information continuously go through all nodes of the network. We call a network 
of this type an update network. 

The operation of an update packet over time enters only a finite number of 
nodes and produces an infinite sequence Vi,V2, ■ ■ called a run-time sequence. 
Since the number of nodes is finite, some of the nodes, called persistent nodes, ap- 
pear infinitely often in the run-time sequence. The success of a run-time sequence 
is determined by whether or not the sequence of nodes satisfies the liveness and 
fairness properties. That is, the set of persistent nodes coincides with the set of 
all nodes of the network. We can regard the run-time sequences as plays of a 
two-player game where one player, called Survivor, tries to ensure that persistent 
states span the network and the other player, called Adversary, does not. 

One possible model for an update network is thus based on a finite directed 
graph as the underlying structure for games between Survivor and Adversary. 
We formalize this in the following definition, where for this paper we restrict our 
attention to the bipartite case. 
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Definition 1. An update (bipartite) game is a bipartite graph G = (V, E) with 
partite sets S and A. There are two players, called Survivor and Adversary, 
which control vertices S and A, respectively. We stipulate that the out-degree of 
every vertex is nonzero. 

The game rules allow moves along edges of G. Each play of an infinite dura- 
tion game is a sequence of vertices vq,vi, . . . ,Vi, . . . such that the game rules are 
followed. We call a finite prefix sequence of a play a history. We say that a vertex 
V is visited if it occurs in the play. Note that either Survivor or Adversary may 
begin the play. Survivor wins a play if the persistent vertices of the play is the 
set V, otherwise Adversary wins. A strategy for Survivor is a function H from 
play histories vq, ... ,Vi, where Vi € S, to the set A such that {vi, H{vq, . . . ,Vi)) 
is an edge. A strategy for Advisory is similarly defined. 

A given strategy for a player may either win or lose the game when starting 
at an initial vertex Vq, where Uq G E. A player’s winning strategy for an initial 
vertex is one that wins no matter what the other player does. 

Example 1. In Figure 1 we present a game Q, where S = {1,3,4}, and A = 
{0, 2, 5}. Here Adversary has a winning strategy from any starting node. If whe- 
never node 0 is reached. Adversary moves to node 4. Thus, node 1 is only reached 
at most once during any play. 




Fig. 1. Example of an update game. 



We now formally define update networks. 

Definition 2. An update game is an update network if Survivor has a winning 
strategy for every initial vertex. 

Update networks can also be viewed as message passing networks where only 
Survivor’s nodes actively participate in the updating process, while Adversary’s 
nodes only need to passively forward a message. 

Example 2. The graph displayed below in Figure 2 is an update network. Here, 
no matter what Adversary does node 0 is visited. Survivor then can alternate 
moving between nodes 2 and 3 to span the network. 
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Fig. 2. A simple example of an update game which is an update network. 



We end this section with a few related references. Previous work on two- 
player infinite duration games on finite bipartite graphs is presented in the paper 
by McNaughton [ 2 ] and extended by Nerode et al. [ 3 ]. Our work focuses on a 
subclass of the games considered by these authors. Nerode et al. provide an 
algorithm for deciding McNaughton games. Their algorithm runs in exponential 
time of the graph size for certain inputs. For our games we provide two simple 
polynomial time algorithms for deciding update networks, partially based on the 
structural properties of the underlying graphs. We also note that several earlier 
papers have dealt with finite duration games on automata and graphs (e.g., see 
[ 4 , 5 ]). 

2 Constructing Update Networks 

We now want to present several construction techniques for building update 
networks from smaller update networks. We introduce three primitive operations 
below for update networks Gi = {S\, Ax,Ei) and G2 = (S'2,^2,G2). 

Enrichment: Take Gi and add edges E' to E\ where all new edges are directed 
from Si to A\. The resulting graph is called an enrichment of Gi, denoted 
Gi UU. 

Identification: Take Gi and G2 and subsets T\ of S'! and T2 of S2 with the 
same cardinality. Let / be a bijection between Ti and T2. Build the graph 
{Si U S'2 \ U, ^1 U A2, El U E) where E is E2 with all vertices in T2 replaced 
with their image under /. This graph is called the identification of Gi and 
G 2 under f, denoted by Gi ©/ G2. 

Extension: Take Gi and G2 then construct the graph {S = U S'2,A = 
2I1 U A2 , El U E2 U E) where E is a new set of edges directed from S' to A 
with at least one edge from Si to A2 and at least one edge from S2 to Ai. 
This graph is called the extension of Gi by G 2 via E, denoted Gi U G2 U E. 

It is easy to observe the following. 



Proposition 1. The enrichment, identification, extension operators applied to 
update networks yield update networks. 
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Example 3. The graphs displayed below in Figure 3 are obtained by applying 
the operations of enrichment, join and extension to the update networks B\ and 
i? 2 - The black and white vertices are Survivor and Adversary nodes, respectively. 




Fig. 3. Illustrating the operations of enrichment, join and extension. 



3 Deciding Update Networks 

In this section we provide an important operation, called the contraction ope- 
ration, that applied to update games produces update networks if and only if 
the original update game is an update network. The operation reduces the size 
of the underlying bipartite graph, and hence produces a decision procedure for 
checking if a given update game is an update network. 

We can easily characterize those bipartite update networks with only one 
Survivor vertex. These are the bipartite graphs where out-degree (s) = |A| for 
the single Survivor vertex s. We now mention several properties for all bipartite 
update networks [1]. 

Lemma 1. If {V = AuS,E) is a bipartite update network then for every vertex 
s € S there exists at least one a € A such that (a, s) € E and out-degree{a) = 1 

Proof. Let s be a vertex that does not satisfy the statement of the lemma. 
Let As = {a \ (a, s) G E}. Then out-degree(a) > 1 for all a £ Ag. Advers- 
ary has the following winning strategy. If the play history ends in a then since 
out-degree(a) > 1, Adversary moves to s', where s' ^ s and (a, s') G E. This 
contradicts the assumption of lemma that we have an update network. □ 

For any Survivor vertex s define Forced(s) = {a | out-degree(a) = 1 and 
(a,s) G E}, which denotes the set of Adversary vertices that are ‘forced’ to 
move to s. 

Lemma 2. If B is a bipartite update network such that [S'! > 1 then for every 
s € S there exists an s' ^ s and an a £ Forced(s), such that (s', a, s) is a directed 
path. 
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Proof. If B has more than one vertex in S then there must be a strategy for 
Survivor to create a play history to visit vertex s from some other vertex s' G S. 
To do this we need a forced Adversary vertex a (of A) in the neighborhood of 
s' . There exists such a vertex a by Lemma 1 . □ 

Definition 3 . Given a bipartite graph {S \J A, E) a forced cycle is a (simple) 
cycle (ofc, Sk, ■ ■ ■ , 02, S2, ai, si) for Ui € Forced(si) and Si G S. 

The following lemma is the main ingredient in characterizing bipartite update 
networks. 

Lemma 3 . If B is a bipartite update network such that [S'] > 1 then there exists 
a forced cycle of length at least 

Proof. Take si G S. From Lemma 2 there exists a path (s2,ai,si) in B such 
that S2 yf Si and oi G Forced(si). Now for S2 we apply the lemma again to get 
a path (53,02,52) in B such that S3 yf S2 and 02 G Forced(s2). If S3 = si we 
are done. Otherwise repeat Lemma 2 for vertex S3. If S4 G {si, S2} we are done. 
Otherwise repeat the lemma for 54. Eventually Sj G {si, S2, . . . , 5^-2} since B is 
finite. □ 

Note if B does not have a forced cycle of length at least 4 then either [S'! = 1 
or B is not a bipartite update network. That is, if [S'] > 1 then Adversary has a 
strategy to not visit a vertex S2 of S whenever the play begins at some different 
vertex si of S. 

We now define a contraction operator for reducing the size of an update 
game. Let B = {S U A, E) he a, bipartite update game with a forced cycle 
C = (ttfe, Sfc, . . . , tt2, S2, oi, si) of length at least 4 . The contraction operator 
applied to B via C, denoted B/C, produces an update game B' = {S' U A' , E') 
with |S"| < jS”!. We construct B' as follows. For new vertices a and s, 

S" = ( 5 '\ {si,S2, . . . ,Sfc}) U {s} and A' = (A \ {m, 02, . . . , Ofc}) U {a} 

and 

E' = E{B \ {si, oi, . . . , Sfc, Ofc}) U 

{(s, a') I a' G A' and (sj, a') G E, for some i < /c} U 

{{a', s) I a' G A' and (o', Sj) G E, for some i < k} A 

{(s', a) I s' G S' and (s', Ui) G E, for some i < k}U {(a, s), (s, a)}. 

We now present a method that helps us decide if a bipartite game is a bipar- 

tite update network. 

Lemma 4 . If B = {S U A, E) is a bipartite update game with a forced cycle C 
of length at least ) then B is a bipartite update network if and only if B' = B/C 



IS one. 
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Proof (sketch). We show that if B' is an update network then B is also an update 
network. We first define the natural mapping p from vertices of B onto vertices 
of B' by 

p{v) = V if V ^ C 
p{v) = a ifuGC'nS' 
p{v) = s if V € C n A. 

Then any play history of B is mapped, via the function p{v) = v', onto a play 
history of B' . Consider a play history vq,Vi, . . . ,Vn of B that starts at vertex ug. 
Let /' be a winning strategy for Survivor when the game begins at vertex Vq. We 
use the mapping p to construct Survivor’s strategy / in game B by considering 
the two cases = s and ^ s. It is not hard to construct a winning strategy 
/ for Survivor in game B whenever /' is a winning strategy in B' . 

By a similar case study one can show that if B is an update network then 
B' is also an update network (see [1] for details). □ 

With respect to the above proof, Figure 4 shows how a forced cycle of B is 
reduced to a smaller forced cycle (of length 2) in B' . 




Fig. 4. Showing the bipartite update game reduction of Lemma 4. 



Theorem 1. There exists an algorithm that decides whether a bipartite update 
game B is a bipartite update network in time 0{n ■ m), where n and m are the 
order and size of the underlying graph. 

Proof. We show that finding a cycle that is guaranteed to exist by Lemma 3 
takes time at most 0(m) and that producing B' from B in Lemma 4 takes time 
at most 0(n + m). Since we need to recursively do this at most n times the 
overall running time is shown to be 0{n ■ m). 

The algorithm terminates whenever a forced cycle of length at least four is 
not found. It decides whether the current bipartite graph is a update network 
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by simply checking that S = {s} and out-degree(s) = |A|. That is, the singleton 
Survivor vertex is connected to all Adversary vertices. 

Let us analyze the running time for finding a forced cycle C. Recall the 
algorithm begins at any vertex si and finds an in-neighbor oi (of si) of out- 
degree 1 with (s2,ai) € E where S2 ^ si- This takes time proportional to the 
number of edges incident to si to find such a vertex ai. Repeating with S2 we 
find an 02 in time proportional to the number of edges into S2, etc. We keep a 
boolean array to indicate which Si are in the partially constructed forced path 
(i.e., the look-up time will be constant time to detect a forced cycle of length at 
least 4) . The total number of steps to find the cycle is at most a constant factor 
time the number of edges in the graph. 

Finally, we can observe that building B' from B and C of Lemma 4 runs in 
linear time by the definition of S' , A' and E' . Note that if the data structure for 
graphs is taken to be adjacency lists then E' is constructed by copying the lists 
of E and replacing one or more vertices Si's or a^’s with one s or a, respectively. 

□ 

The above result indicates the structure of bipartite update networks. These 
are basically connected forced cycles, with possibly other legal moves for some 
of Survivor and Adversary vertices. 



4 Level Structures of Update Networks 

In this section we introduce a hierarchy of update networks which can be used 
to extract ‘efficient’ winning strategies for Survivor. The class EIq contains all 
update networks with exactly one Survivor node. Assume that the class Elk has 
been defined. Then an update network G belongs to Hk+i if and only if G can be 
contracted (via some forced cycle) to a G' such that G' £ Hk- Thus, by Lemma 4, 
G is an update network if and only if there exists a sequence G = Gq, Gi, . . . , G^ 
of update networks and forced cycles Gq, Gi, . . . Gk-i such G^ G Hq and Gi is 
obtained by the contraction operator applied to Gi-i via Gi. 

Definition 4. The level number k of an update network G is the minimum 
number of contractions of forced cycles needed to produce an update network 
with one Survivor node, that is G £ Hk and G ^ Hk-i- 

Intuitively, the level number represents a complexity of update networks. 
Thus, given an update network it is quite natural to compute the level number. 
Computing the level number, in turn, is related to finding a large forced cycles 
in the network. One approach for estimating this level number can be based on a 
greedy algorithm that repeatedly finds a largest forced cycle and applies the con- 
traction operator. However the following proposition shows that this approach 
is not efficient. 

Proposition 2. To find the longest forced cycle in an update game is NP- 
complete. 
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Proof. Let LongForced be the decision problem to decide if an update game 
has a forced cycle of length at least k. We know that deciding, for input a digraph 
G and integer k, whether G has a cycle of length at least k, called LongCycle, 
is NP-complete. We can polynomial-time many-one reduce the input (G, k) of 
LongCycle to an input (G', 2k) for LongForced as follows. Let G' be G with 
every edge subdivided, that is for every edge (u,v) in G is replaced with edges 
(u,a) and (a,v), where a is a new vertex. Clearly G' is a bipartite graph where 
the new vertices correspond to the Adversary set A. If G' has a forced cycle of 
length 2k then G has a cycle of length at least k. Obviously, the other direction 
holds. □ 

It turns out that, in general, the greedy algorithm does not compute the level 
number of update networks, as illustrated with the networks shown in Figure 5. 
Here, the graph has three forced cycles, one small one of length 4 and two large 
ones of length at least 5. The greedy algorithm would use 4 contractions, while 
the level number is actually 2. 




Fig. 5. Why the greedy algorithm does not find the smallest level number. 



5 Simple Update Strategies 

We now want to consider how to design Survivor winning strategies when we 
have an update network. 

From the previous section we see that one way for constructing winning 
strategies is to explicitly apply the contraction operation repeatedly while re- 
membering the sequence of forced cycles. From Lemma 4 a sequence G, Gi, . . . , 
Gfe of update networks is obtained from the original network G. Here the final 
network Gk with one survival node has a trivial winning strategy, where it mo- 
ves repeatedly in a cyclic fashion to all of its neighbors. The proof of the lemma 
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indicates how to compute a winning strategy for Gi from the winning strategy 
for Gi+i- (Recall, a forced cycle of length 2 in Gi+i is uncontracted to a forced 
cycle of length at least 4 in Gi.) We call strategies of this type contraction-based 
strategies. 

Example 4- Consider the update network G' of Figure 4. If we apply the contrac- 
tion algorithm again on the forced cycle ( 03 , 53 , 0 , 3 ) we get an update network 
with one Survivor node s' that has a winning strategy to alternate sending to the 
Adversary nodes a' and 04. Doing an uncontraction yields a winning strategy 
for G' where node s alternates sending to 03 and 04 and node S3 always sends 
to a. 

It turns out there is another simple local winning strategy (which may not be 
as efficient at updating all the nodes) that only requires each Survivor node to 
systematically update their neighbors. This strategy needs only to follow simple 
local rules with out any knowledge of the topology of the network. We now 
formally present this cyclic neighbor strategy. Each Survivor node s keeps an 
index is between 0 and ds=out-degree(t 6 )-l, initially set to 0. Survivor’s next 
move from node s is to its tg-th out-neighbor and ig is incremented modulo dg. 
We now show this is a winning strategy. 

Theorem 2. The cyclic neighbor strategy is a winning strategy for any update 
network. 

Proof. Consider any play consistent with the cyclic neighbor strategy for an 
update network G. There will be a set of persistent vertices called P.li P = AuS” 
then we are done. Now assume otherwise and consider the set N = {AU S)\P. 
There is at least one Adversary node in N by Lemma 1 (an adversary vertex 
of out-degree 1 can not be in P if it’s out-neighbor is in N). If we follow the 
cyclic neighbor strategy there must not be any edges from P (1 S to N (1 A, since 
all vertices in P were reached infinitely often. Thus Adversary has a winning 
strategy to keep out of N forever (no matter what strategy Survivor uses). This 
contradicts the fact that G was an update network. □ 

The above proof leads to the following more general winning strategy. 

Corollary 1. Any Survivor strategy for an update network that from any s € S 
moves to each neighbor of s infinitely often is a winning strategy. 

One can compare the cyclic neighbor strategies with the contraction-based 
strategies. In general the contraction-based strategies are more efficient at upda- 
ting all nodes of a network more frequently. However, there is a quadratic cost 
of having to compute these. However, like the cyclic neighbor strategies, there 
is very little operational cost in using a contraction-based strategy. The only 
requirement is that a sequence of neighbors (with possible repetitions) has to be 
kept at each node instead of just a single counter. 

We end this section with a couple comments about Adversary winning stra- 
tegies for update games that are not update networks. The decision algorithm 
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fails because there are no forced cycles of length at least 4 when there remains 
at least two Survivor nodes. In any case, Adversary knows which Survivor nodes 
to avoid. This strategy is a counter strategy to Survivor’s contraction-based 
strategy mentioned above. 

6 Conclusion 

In this paper we have presented a game-theoretic model for studying update 
networks. We have shown that it is algorithmically feasible to recognize update 
networks. That is, we have provided an algorithm which solves the update game 
problem in 0{n ■ m) time. Moreover, our algorithm for the case of bipartite 
update games is used to give a characterization of bipartite update networks. 
We have presented two simple routing strategies for updating these networks. 
We have also discussed the complexity of update networks in terms of the level 
number invariant. In general, update networks with the lower level numbers have 
simpler and more efficient updating strategies. 
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Abstract. We consider the following problem: Each processor of the 
network has assigned a (not necessarily unique) input value. Determine 
multiplicity of each input value. Solving this problem means any input- 
symmetric function (i.e. function not sensitive to permutations of its 
input values) can be computed. We consider anonymous synchronous 
networks of arbitrary topology, in which dynamic link faults [3,6] may 
occur. 

An instance of this problem has been stated as an open problem by N. 
Santoro at SIROCCO’98: Is it possible to distributively compute parity 
function (XOR) on anonymous hypercubes with dynamic faults? 

We show that if the network size N (the number of processors) is known, 
the multiplicity of inputs (and thus any input-symmetric function) can 
be computed on any connected network. The time complexity depends 
on the details of the model and the amount of topological information, 
but it is always a low polynomial in N. 



1 Introduction 

The problem of fault tolerance is very important in the field of distributed com- 
puting. Indeed, as more and more processors are being interconnected, the pro- 
bability of faulty links or nodes increases. According to classical model of fault 
tolerance - static model ~ the fault status of any component is fixed during the 
whole computation. Results based on this model cover fault tolerant distributed 
agreement as well as broadcasting, gossiping and routing. Consult a survey by 
Pelc [5] for more details. 

The communication model we consider is synchronous point-to-point mes- 
sage passing with dynamic faults, introduced in [3,6]. The computation network 
is represented by simple non-directed graph G = (V,E), where V and E repre- 
sent processors and bidirectional communication links, respectively. In the rest 
of the paper we use N for the number of processors. The links at a processor v 
are labeled 1,2,..., deg^, where degy is the degree of v. These labels are assigned 
in an arbitrary way, with no relevance to the structure of the network. 

The network is synchronous, in the sense that the communication and com- 
putation steps alternate. Each communication step takes 1 time unit, the time for 
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local computation is negligible. At one communication step, a processor can si- 
multaneously communicate with all its neighbours. This communication scheme 
is called shouting or all port model. We allow messages of arbitrary size. This 
unrealistic assumption will be discussed later on. 

Only link failures are considered. At each step at most k links may be faulty. 
The nature of the faults is dynamic - the set of faulty links may vary from 
step to step. All faults are of the bidirectional crash type, i.e. messages sent 
in any direction over a faulty link are lost. We suppose k is smaller than the 
edge connectivity of the network, otherwise it would not be possible to perform 
broadcast, neither to compute any global function. 

The network is anonymous, all processors are identical, equipped with the 
same algorithm. Computation is started simultaneously by all processors and 
terminates when all of them reach the termination state. The time complexity 
is expressed as the total number of communication steps. 

The most investigated problem in this model is the time complexity of bro- 
adcasting, studied on special topologies [2,3,4] as well as on general networks 
[1]. In fact, the broadcasting algorithm is always the same: Shout all you know 
to all your neighbours. The intricacy lies in showing how much time it takes to 
reach everyone when the adversary dynamically chooses the links to fail. Clearly, 
N— 1 steps are enough - at each step at least one new processor is informed. This 
bound can be improved to 0{N/k) [1], where k is the number of links that may 
fail at one step. Better results can be obtained for special topologies, sometimes 
almost matching the broadcasting time in the absence of faults (e.g. diameter-|-2 
for hypercubes [2], see also [4]). 

The problem of distributed evaluation of an Wary function / can be sta- 
ted as follows. At the beginning of the computation each processor is given an 
input value from a set of input values I, upon termination each processor should 
know the result of the application of / to the whole input. Since the network is 
anonymous and the output value is unique, / should be invariant to the permu- 
tations of input values imposed by the symmetries of the underlying network. 
As we consider general graphs, for the rest of the paper we restrict ourselves 
only to functions which are invariant to all permutations of inputs. (Boolean 
AND/OR/XOR and the sum of input values are examples of such functions.) 

The problem has been studied by Santoro and Widmayer in [6] for non- 
anonymous complete graphs, with several fault modes (crash, corrupting/ adding 
messages) being considered. 

If the network size N is known to processors, a broadcasting algorithm can 
be used to compute the boolean AND/OR of the inputs [6]: Each processor 
broadcasts its value and collects the values it sees. After iV — 1 steps it has 
received all values and can output their boolean AND resp. OR. This approach 
works well for AND/OR, because these functions depend only on the values 
present in the input, not on their multiplicity. It is not immediately clear how to 
compute more input sensitive functions, e.g. parity function (XOR) or sum of the 
input values, in the presence of dynamic faults. In fact, at SIROCCO’98 rump 
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session N. Santoro stated as an open problem even more restricted question: Is 
it possible to compute the XOR on anonymous hypercubes with dynamic faults? 

In this paper we present a deterministic algorithm InputMulti, which, given 
the network size N, computes the multiplicity of each input value in the pre- 
sence of dynamic faults. This means that any input-symmetric function can be 
computed, giving a positive answer to the open problem mentioned above. The 
knowledge of N is necessary. It is known that in general it is not possible to 
compute the network size on anonymous networks [7] deterministically. Howe- 
ver, computing the multiplicity of inputs implies knowledge of the network size 
(consider all input values being equal). 

Let B be an upper bound on time needed for broadcasting in a given network 
in the presence of dynamic faults, known to all processors. (Clearly B < N — 1, 
however, it can be much less if some topological information is given to proces- 
sors. For example if processors know that the communication topology is hyper- 
cube network, they can work with B = log N + 2.) Then the time complexity of 
the algorithm InputMulti is 2BN. 

Some of the model requirements are unnecessarily strong, we show how to 
adapt InputMulti to the following relaxed variants of the model in section 4: 

— Asynchronous start up: The computation is started by some non-empty 
set of initiators, instead of all processors. The cost for initial synchronization 
is additional B steps. 

— Unidirectional faults: The fact that a link always fails in both directions 
allows us to modify any algorithm in such a way that sender can detect 
failure of his message transmission: Add dummy messages, if necessary, to 
make sure that at each step, a message is sent via each link in both directions. 
Then, the fact that no message has been received on link h indicates that 
the message sent via h has been lost too. Unless stated otherwise, from now 
on we assume that this technique is implicitly used and sender can detect 
failure of his message transmission. 

If a link may fail in one direction only, sender can not straightforwardly 
detect call failure. The algorithm InputMulti assumes bidirectional faults, 
however this assumption is not essential. We present a work around this 
problem in cost of 2fc -I- 3 multiplicative factor. 

~ Short messages: Transmission of messages of arbitrary size in one time step 
is unrealistic. InputMulti can be adapted to the case when only messages of 
limited size (e.g. O(logA)) are allowed, with a slowdown factor of a low 
polynomial in N (depending on the details of the model) . 

The paper is organized as follows. In section 2 we present the algorithm 
InputMulti for computing the multiplicity of input values in the presence of 
dynamic faults and prove its correctness and time complexity. Section 3 contains 
a technique which, for some symmetric topologies, allows further reduction of 
the time complexity. Application to hypercubes yields an 0(log^ N) solution. In 
section 4 adaptations of the algorithm InputMulti to above mentioned relaxations 
of the model are presented. Finally, concluding remarks can be found in section 5. 




140 



S. Dobrev 



2 Computing the Multiplicity of Input Values 

The algorithm InputMulti works in stages. The first stage is started by all proces- 
sors. During each stage an asymmetry of the network, input data or occurrence 
of faults is used to reduce the number of initiators of the next stage. If no asym- 
metry is detected, the symmetry of the configuration allows computation of the 
result. In fact, InputMulti is a leader election algorithm, which either finishes 
prematurely by being able to compute the result, or elects a leader. If a leader is 
elected, computing the result is straightforward, since the network is no longer 
anonymous. 

Each stage consists of a building phase, a broadcasting phase and a resolving 
step at the end of the stage. Since the network is synchronous, all processors 
start computation synchronously and they know B, each processor knows when 
each phase/stage begins and when it ends. 

During the building phase flooding of the network from initiators is performed 
and each vertex is assigned a (not necessarily unique) name. In the course of this 
phase several broadcast processes are launched to inform other processors of 
what has been built. The building phase takes B steps - enough for the flooding 
to reach the whole network. The broadcasting phase takes additional B steps 
and it allows the broadcasts initiated during the building phase to reach all 
processors. This ensures that at the time of the resolving step all processors 
have the same information about the building phase. In the resolving step any 
asymmetry captured in this information is used to either identify the initiators 
of the next stage or compute the multiplicity of each input value. In the latter 
case the algorithm InputMulti announces result and terminates. 



2.1 Building and Broadcasting Phase 

During the building phase a forest of trees rooted at the stage initiators is built. 
The roots start with a name e, while the name of a son u of a processor v 
with name x is x o i^ where i is the label of the link (v,u) at v. The names 
are assigned by the flooding protocols initiated at roots: A processor v with an 
already assigned name x offers names x o 1, x o 2, . . . , a; o to its neighbours 
(neighbour to which link i leads to is offered the name x o i) . If a processor has 
not been assigned a name yet, it accepts one of the name offers it has received 
and proceeds by offering names to its neighbours. In addition, acceptance or 
rejection of each offer is announced by broadcasting alive/ dead messages. The 
idea is to have for each offer generated either an alive or a dead message, but 
not both (or neither) . The broadcasting phase is just a time buffer of B steps to 
guarantee all alive and dead messages arrive at all processors. During the both 
phases each processor collects all alive and dead messages it has seen during the 
current stage. 

A processor v has a local variable Namey containing its name, valid only for 
the current stage. The type of Namey is an ordered list of integers (e being the 
empty list), listox means adding element x to the end of the list list, valy is the 
input value of v and degy is its degree. 
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Broadcast (message) denotes a process which broadcasts message message to 
all other processors. 

KeepSending(( low are: x), link) is a process which sends message (low are: 
x) via the link link in each step until the end of the building phase. If during all 
these sending steps the link link was faulty, a message {dead, x) is broadcasted. 
(The offer never came to the receiver, act as if it was rejected.) 

The algorithm for an initiator v: 

Namey := e; 

Broadcast ( ( alive, e, valy, degy); 

KeepSending(( low are: i), i) for i = 1,2, . . . , degy-,{For all incident links.} 

At a vertex v upon receiving (low are: mi), . . ., (low are: mi) via links 
hi,h2,...,hi^: 

if V already has a name then 

Broadcast ((dead, m^)) for t = 1, 2, . . . 1; 

else 

NamCy := lexicographically minimal value mj among m^; 

Broadcast (( alive. NamCy, valy, degy))-, {One offer is accepted,} 

if message (low are: NamCy) has been received via at least two links then 
Broadcast (( dead, NamCy))-, {others are rejected.} 

Broadcast ((dead, m^)) for all m^ such that NamCy mp, 

KeepSending(( low are: NamCy oi), i) for the links i over which you have 
not received (during the current stage) a low are: message; 



The main process, as well as the child processes KeepSending and Broadcast 
execute simultaneously and interfere only in an explicitly stated way. This is 
possible because the link contention is resolved using messages of unlimited size: 
If several messages are sent simultaneously over a link, they are packed into one 
big message which is upon arrival unpacked back into original messages. 

As the above algorithm is in fact extension of the shout-to-all broadcasting 
algorithm, each vertex computes its name in time at most B. 



2.2 Resolving Step 

The resolving step is performed at the time 2B since the start of the current stage 
- after the termination of the broadcasting phase. At that time all processors 

^ Only the first ( You are:) message on a given link (during this stage) is considered 
here, its snbseqnent arrivals (due to KeepSending()) are ignored. 
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have received the same set of alive/dead messages and the following processing 
is performed by each of them: 

1. If there is a name for which both alive and dead messages were received, the 
processors with lexicographically minimal name among such names become 
initiators of the next stage. 

2. Otherwise, only alive or only dead messages were received for each name. 
Consider now the names for which only alive messages were received. If 
there is a name for which different values of valy or degy were received, the 
processors with minimal such name and with lexically minimal pair (valy, 
degy) become initiators of the next stage. 

3. Otherwise identical alive messages were received for each name. In this case 
the multiplicity of value val is given by \val\ ■ N/\Name\, where Name is the 
set of all names for which an alive message was received during the current 
stage and \val\ is the number of different alive messages in which val occurs. 

As this processing is performed at each processor, after the resolving step each 
processor knows whether it is an initiator of the next phase, passive element, or 
the result is computed. Note that if the current stage had a single initiator, all 
the assigned names are different and the last case applies. 
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Example. An example execution of InputMulti on a 4-node ring is shown 
in Figure 1. Each processor is marked by its name and its input value in the 
brackets. The arrows represent the You are: messages, the messages broadcasted 
by each processor are shown above/below it. 

In the first stage all vertices are initiators, thus starting with name e. The 
leftmost figure depicts the situation at the end of the first stage. At the time 
of the resolving step each vertex has collected the following set of alive/ dead 
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messages: {{alive, e, 5, 2), {alive, e, 7, 2), {alive, e, 9, 2), {dead, l),{dead, 2)}. In 
this case the rule 2 applies: The processors with the name e and the input value 
5 become the initiators of the next stage. 

The situation at the end of the stage 2 is depicted in the middle figure. 
The set of collected alive/ dead messages is the union of the sets listed for each 
processor. Again, the rule 2. applies, this time for the processors with name 2. 

Finally, the situation at the end of the stage 3 is shown in the rightmost figure. 
(The top link has been blocked during the whole building phase.) Now rule 3 
applies and the multiplicity of input values is computed: Name = [e, 2, 21, 211}, 
N/\Name\ = 4/4 = 1, |5| = 2 (2, 21), |7| = 1 (e) and |9| = 1 (211). 

Note that if the input values of the two top processors were the same (e.g. 
7), the rule 3. would apply in the second stage. 

2.3 Complexity and Correctness 

For a given stage, denote by //{x) the number of processors with the name x. 
Note that #(cc o i) < //{x), because a processor with the name x sends message 
{You are: i) to at most one of its neighbours. 

The analysis of the algorithm InputMulti is based on the following lemma: 

Lemma 1. Let the algorithm InputMulti reached stage r + 1, r > 0. Then the 
number of initiators of stage r + 1 is strictly less then the number of initiators 
of stage r. 

Proof. The initiators of stage r + 1 are determined at the resolving step of 
stage r by rules 1. and 2.. Suppose the initiators were determined by the rule 1. 
applied to name x. In that case both {alive, x, . . .) and {dead, x) messages were 
generated. Clearly x yf e, because no dead message for e can be generated. Let 
X = y o i for some y and i. {alive, x, . . . ) message means there is a processor v 
with name y whose offer ( You are: y oi) has been accepted, while the {dead, x) 
means there is a processor u with the name y whose offer ( You are: y o i) has 
been rejected or never delivered. It follows: The number of initiators of stage 
r + 1 = #(x) < #(y) < #(e) = the number of initiators of stage r. 

Consider now the case when the initiators were determined by the rule 2. 
applied to name x. Since #(x) < #(e) and a proper subset of processors with 
the name x is chosen as the set of initiators for the next stage, the lemma holds 
also in this case. □ 

Since the original number of initiators is N, InputMulti terminates at last at 
the end of a stage with single initiator, and each stage takes 2B steps, we get 
the theorem: 

Theorem 1. The algorithm InputMulti terminates in time at most 2BN . 

The following theorem completes the analysis of the algorithm InputMulti: 

Theorem 2. The algorithm InputMulti correctly computes the multiplicity of 
each input value. 
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Proof. Since the values at processors with the same name are equal (otherwise 
rule 2. would apply), the multiplicity of the value val is given by a ^ over 
names x, for which the processors with name x have input value val. If we show 
\fx,y € Name: ff{x) = #(y), this sum can be simplified to |uaZ| Moreover, 
in such a case #(x) can be computed as N/\Name\. The resulting expression for 
multiplicity of the value val is \val\ ■ N /\Name\, which is exactly the term used 
in the resolving phase. 

Vx,?/ € Name : ff{x) = ff{y) is equivalent to Vx G Name : #(x) = #(e). 
We prove this assertion by induction on the length of x. The first step is trivial: 
#(e) = #(e). The induction step is Va;, xoi & Name, ff{x o i) = ff{x). Since the 
inequality ff{xoi) < ff{x) has already been shown and the algorithm terminates 
only when no dead message was received for all a; o i g Name, the only case left 
for discussion is whether the inequality ff(xoi) < ff{x) might be proper. The 
negative answer would finish the proof. 

Let us assume ff{xoi) < ff{x) without generating message {dead, xoi). That 
means all xoi offers were accepted. Since ff{xoi) < ff{x), there was a processor 
V with name x, which did not offer name a; o z to any of its neighbours. That 
means deg^ < i. However, x o i £ Name implies the existence of a processors 
v' with the name x and degree at least i. Since the algorithm terminates only 
if all processors with the same name have the same value and degree, we get 
a contradiction. □ 



3 Special Topologies 



The previous result shows that the multiplicity of input values can be computed 
within the time 2BN. The factor N (which on many topologies (e.g. hypercubes, 
star graphs and tori) is significantly higher than B) is determined by the number 
of initiators in the first stage. For a given graph G, let S be the time needed to 
compute the multiplicity of input values in the absence of faults. The following 
preprocessing reduces the number of initiators to 2k (recall that k is the maximal 
number of faults in one step): 

— Run the computation of multiplicity of input values assuming no faults occur. 

— Run concurrently the following fault detection algorithm for S steps: At 
every step each processor sends dummy message to all its neighbours and 
tests whether it has received messages on all its links. Missing message indi- 
cates a fault. When a processor detects a fault, it broadcasts the time when 
the fault has occurred. 

— Wait B steps - the time sufficient to finish broadcasts. If no fault has been 
detected, the multiplicities were correctly computed. Otherwise, the proces- 
sors with minimal time of an incident fault are initiators of the first stage. 
Obviously, there are at most 2k initiators (when all first faults were duplex 
and vertex-disjoint). 
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Substituting 2k for N (the number of stages) in Theorem 1 and taking into 
account the cost S' + i? of this preprocessing we get: 

Theorem 3. Let S he the time needed to compute the multiplicity of each input 
value on G in the absence of faults. Then the multiplicity of input values can he 
computed in the presence of dynamic faults in time S + (4fc + 1)B. 

The following lemma allows the Theorem 3 to be successfully applied to many 
frequently used symmetric topologies (e.g. hypercubes, rings and cliques). The 
main idea is to broadcast all values and to use the number of paths of a given 
length between processors, the multiplicity of received messages for a value x 
and the time of the arrival of these messages, to compute the multiplicity of x. 

Lemma 2. Let G be a topology for which the number of different paths P\ of 
length I between two vertices of distance d is the same for all such pairs of vertices 
and for all d and 1. Suppose this topology is known to all processors. Then in 
the absence of faults the multiplicity of input values can be computed in time D, 
where D is the diameter ofG. 

Due to space limitations the proof (as well as the proofs in the remainder of this 
paper) is left to the reader. 

Lemma 2 in combination with Theorem 3 can be applied to hypercubes. Since 
for hypercubes D — log N, k = log N — 1 and B = log + 2, we get: 

Corollary 1. O(log^iV) steps are sufficient to compute the multiplicity of input 
values on an anonymous D-dimensional hypercube with dynamic faults. 



4 Relaxing the Model 

4.1 Asynchronous Start Up 

In the previous discussion all processors have been supposed to start the execu- 
tion of the algorithm simultaneously. If that is not the case, B steps are sufficient 
to synchronize them using the following protocol. 

— Upon spontaneous wake up: Send clock value 1 to all neighbours inclu- 
ding yourself. 

— Upon receiving messages (ti) for i = 1, . . . ,1: Set your local clock to the 
maximal value received. If it is equal to B, terminate. Otherwise send your 
local clock-|-l to all neighbours including yourself. 

Since the maximal clock value grows each step by 1 and broadcasting takes 
at most B steps, each processor receives message containing B at the time B 
after the wake up of the first processor and no such message was received before. 
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4.2 Unidirectional Faults 

Up to now, the sender was supposed to detect failure of its call using the fact 
that links always fail bidirectionally. This requirement seems to be quite strong. 
Fortunately, it is not necessary: The only situation in which bidirectionality of 
faults is used is when KeepSending detects that it failed to transmit You are: 
message during the whole building phase. In the case of such permanent faults 
the receiver knows that something is wrong - it has not received any message on 
the faulty link during this stage. What the receiver does not know is the name 
that has not come. So, it is the sender who should send the corresponding dead 
message and, for that reason, it needs the mentioned feedback. A possible way 
around is to make the receiver know the name: 

The simple sender(father) ~ receiver(possible son) communication from the 
Input Multi algorithm is replaced by a three step protocol (see Figure 2.): 

— Sender keeps trying to send receiver message {You are: y). 

— After receiving a ( You are: y) message, processor keeps trying to send reply 
{OK, y). 

— When a processor receives {OK, y) message, it keeps trying to send reply 
{Proceed, y). 



Simple Protocol 



Advanced Protocol 



time X (You are: x o i) x o i 

• • 

. ^ . 

• *• 



X (You are: x o i) x o i 







• X/ 


• 










(OK: X o i) 












(Proceed: x o i) 






• d\ 






Fig. 2. 



A processor computes its name based on Proceed messages in the same way 
as in the simple protocol from You are: messages. The dead and alive messages 
are generated in the same way, except for: 

— At the end of the building phase sender broadcasts {dead, y) message if it has 
not received corresponding {OK, y) message to his {Your are: y) message. 

~ At the end of the building phase receiver broadcasts {dead, y) message if it 
has not received corresponding {Proceed, y) message to his ( OK, y) message. 

Note, that although both - sender and receiver - might broadcast dead message, 
it is never the case that one of them broadcasts alive and the other dead message. 
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Thus, the crucial property which the resolving step and correctness arguments 
are based on is maintained. 

To make InputMulti work using this three-step protocol, enough time must 
be reserved for the building phase to let each processor compute its name. One 
may naively hope for a constant slowdown factor of 3 for the resulting algorithm, 
compared to the simple one. However, during 3 consecutive steps the faults can 
be adversely placed on up to 3fc links, blocking the completion of the protocol 
on all of them. Nevertheless, the time for the building phase can be reasonably 
bounded: 

Lemma 3. In the presence of unidirectional dynamic faults, each processor com- 
putes its name within {2k-\-3)B time steps since the start of the building phase. 

Since the broadcasting phase takes B steps also in the case of unidirectional 
faults, we get: 

Theorem 4. Let N be the network size, B be an upper bound on time needed 
for broadcasting in the presence of up to k dynamic faults and let B and k be 
known to all processors. Suppose that links may fail unidirectionally . Then the 
multiplicity of input values can be computed in time (2k-l4)BN. 

4.3 Short Messages 

Up to now we have supposed that messages of arbitrary size can be sent in one 
step. Straightforward simulation of long messages by a sequence of short ones is 
not possible, because it gives faults time to block much more than k messages. 
However, as the following theorem shows, a clever simulation is still possible. 

Let a unit message be a message that can be transmitted in one step, e.g. of 
size O(logfV). 

Theorem 5. Let s be an upper bound on size (in unit messages) of the maximal 
message an algorithm A uses on graph G and k be an upper bound on the number 
of faults in one step, both known to all processors. Let T be the time complexity 
of A on G. Then there exists a simulation A' of the algorithm A using only unit 
messages, with time complexity T(1 -|- 2(s — l)(fc -|- 1)). 

The idea of the proof is to split a long message into short blocks and try sequen- 
tially sending these blocks. The transmission of the next block will start only 
after receiving acknowledgement of arrival of the current one. 

Since the size of processor names constructed by InputMulti is at most 
0{B log N) bits and there are at most \E\ broadcasts of dead and alive mes- 
sages running concurrently, the maximal message size in unit message can be 
bounded by 0{N^). As both k and B are less then N, substituting T = 2BN in 
the Theorem 5 yields: 

Corollary 2. Let N be the network size, known to all processors. Then the 
multiplicity of input values can be computed in time 0{N^). 

For the case of hypercubes we have k,B G 0(logN), T S 0(log^ N) and 
s G 0{B\E\) = 0{Nlog^ N), which yields much better bound of 0{Tsk) = 
0(log^ N ■ Nlog^ N ■ logiV) = 0(iVlog® N). 
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5 Conclusions 

We have shown that the multiplicity of input values, and thus any function inva- 
riant to permutations of input values, can be computed on anonymous networks 
with dynamic faults. The time required is small polynomial in the network size 
N and can be further reduced by additional topological information. The algo- 
rithms presented here were designed with simplicity in mind, there is a room for 
further improvements: 

~ The resolving step can probably be improved to significantly reduce the 
number of initiators, eventually reducing the number of stages from N to 
log N. Although the multiplicity of receiving a given alive message is not 
reliable and can be altered by link faults, it still gives some estimate of 
relative density of the processors with the corresponding name. 

— The {2k + 3) estimate for the slowdown of the InputMulti using the tree-step 
protocol, compared to the simple one, is rather rough. It might be possible 
to improve it, although a different technique is needed. The same applies to 
the simulation of long messages by short ones. 

~ The reduction of size and number of messages would be interesting, with 
most significant implications in the model allowing only short messages. 

Nothing is known for the case of more malicious faults (corrupting/ adding mes- 
sages). 
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Abstract. Diameter of the 2*^-node Knodel graph is [(fc + 2)/2]. 



1 Introduction 

Recently, diverse properties and invariants of interconnection networks, not 
only those of parallel machines, have been studied and a number of interesting 
results has been found, see e.g. [1]. One of the important features of an inter- 
connection network is its message passing ability. The quality of an interconnec- 
tion network depends mainly on the time delay of the communication between 
the nodes, which can be either processors or computers or other type of terminals. 
Let us suppose that a node knows a piece of information and needs to transmit 
it to every other node in the network. This task is usually called broadcasting. 
When each node knows a piece of information that has to be transmitted to every 
other node, we are speaking about gossiping. We refer to [6,8,9] for surveys on 
broadcasting and gossiping. The Knodel graph was introduced 25 years ago by 
Knodel [11], as an interconnection network where gossiping can be performed in 
the minimum time if the 1-port, telephone model is supposed. More precisely, 
Knodel gave an algorithm in a complete graph on even number of nodes, so that 
it allows to complete gossiping in the minimum possible time and Fraigniaud 
and Peters [7] formally defined the Knodel graph, which is the graph underlying 
the Knodel’s communication pattern. 

The Knodel graph on 2^ nodes is of special interest. It contains the minimum 
possible number of edges while the time needed to achieve gossiping is the same 
as it would be in the complete graph. In other words, this graph is a minimum 
gossip graph. It can also be shown that the Knodel graph on 2^ nodes is a 
minimum broadcast graph [12] and a minimum linear gossip graph [7]. In [5], it 
was shown that many of the graphs given as examples of minimum broadcast 
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graphs in [4,10] or of minimum gossip graphs [12], are in fact isomorphic to the 
Knodel graph. It should be noted that if the number of vertices is not a power 
of 2, then the Knodel graph need not be a minimum gossip or broadcast graph. 

Knodel graphs were also studied in [3] in connection with the load balancing 
problem in parallel systems. 

Let us define the Knodel graph in general: The Knodel graph on n > 4 nodes, 
n is even, denoted Wn, is composed of nodes (i,j) where i = 1, 2 and 0 < j < 
n/2— 1 and of edges between any two nodes: (l,j) and (2,j + 2^ — 1 (mod n/2)) 
for any j, 0 < j < n/2 — 1 and 1,0 < I < [log 2 nj — 1; the edges connecting the 
nodes: (1, j) and (2, j + 2^ — 1 (mod n/2)), for any j, 0 < j < n/2 — 1 are said 
to be in dimension 1. 

Diameter is one of the main invariants of an interconnection network giving 
the number of hops in a shortest path that a message has to perform on its way 
from the source to the destination node in the worst case. Although the Knodel 
graph was introduced 25 years ago, the diameter of the Knodel graph has not 
been determined yet. 

In this paper we prove: 

Theorem 1. For k >2 

diameter (IT 2 fc) = 

In the following section we introduce an alternative definition of the Knodel 
graph. In sections 3 and 4 we present the upper and lower bound parts of the 
proof of the Theorem, respectively. The last section is devoted to some open 
questions and conclusions. 

2 Alternative Definition 

We introduce an alternative and useful definition of the Knodel graph Wn, where 
n = 2^. Let us take the Hamiltonian cycle of the Knodel graph composed of 
edges of dimensions 0 and 1 and label the nodes of the Knodel graph as follows: 
(1, j) = 2j, (2, j) = 2j — 1, 0 < J < n/2 — 1. It means that a node of the Knodel 
graph, labelled by an even number x, is incident to the edges connecting it to 
the nodes: x + 2* — 3 (mod n), for each 1 < / < fc. Similarly, all neighbours of a 
node labelled by an odd number x have labels of the form x — (2* — 3) (mod n) . 
The edges corresponding to I are the edges of the {I — l)st dimension of the 
general definition of the Knodel graph. See Figure 1 with the Knodel graph Wiq. 

As the Knodel graph is vertex symmetric [5], to determine the diameter, it 
is sufficient to study the distance from the node 0 to any node x. If there is a 
path from the node 0 to a node x using / edges in dimensions: , 

then there exists a representation of x in the following form: 

/ 

x = ^(-l)*-i(2'‘+i -3) (modn). 

2=1 
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Fig. 1. Knodel graph Wie 



Conversely, any such representation of x corresponds to a path between 0 and x. 
Therefore, determining the shortest path between 0 and x is equivalent to finding 
the above representation of x with minimal possible /. The representation will 
be called irepresentation and we will say that the length of the irepresentation 
is /. Note that for even x the ±3 terms disappear from the above representation. 

3 Upper Bound 

In this section we show that the diameter of is at most \{k + 2) /2] . 

3.1 Even Case 

Let x be an even /c— bit number. Then 

m 

i=l 

such that k > ii > ji > Z 2 > j 2 > > *m > jm > 1, where each pair 2** — 2^* 

represents the block of ones in the binary representation of x i.e. 2*‘ — 2^* = 
2 *t-i _|_ _|_ 2 lt_ Knodel graph each pair 2** — 2-1* represents 

two edges: one edge from the it— th dimension and the other from the jt— th 
dimension which follows after the first one and the irepresentation gives a path 
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how the node x can be reached from the node 0. This representation will be 
called the basic irepresentation. 

Let I = {ii, Z 2 , tm}, J = {ji) J 2 j •••, Jm}- Let #(A) denote the cardinality 
of a set A. A maximal sequence of consecutive ones in the binary representation 
of X is called a block. A maximal set of consecutive numbers from lUJ is called 
a iblock. A number a such that 1 < a < A: and a ^ I U J we will call a gap or a 
gap number if more suitable. Notice that 0 ^ / U J and we will not count it to 
gap numbers. An important fact is that each iblock is accompanied with one 
gap number which is a number by one greater than the maximum of the block. 
The only exception is a (possible) block with k. 

Let be the set of iblocks of integers from / U </ such that the maximums 
of the iblocks are from / and the minimums of the iblocks are from J i.e. the 
length of a iblock from is even. Let B~ be the set of even length iblocks 
of integers from / U J with maximums from J. Similarly, we define the set of 
iblocks {B~) which is the set of the iblocks with maximums from I (J) 
and the lengths of the blocks are odd and strictly greater than 1. For the sets of 
blocks of length 1 we will use notation B^ and B^ . 

The sum of lengths of all iblocks in a set of iblocks B*, where p G {e, o}, and 
* G {+, — } will be called total length of the set and denoted \B*\. 

In what follows, we show that either the basic irepresentation of x satisfies 
#(/U J) < I" (k+2) /2] or we can transform the basic irepresentation into another 
irepresentation, where new I and J are sets s.t. #(/ U J) < \{k + 2)/2]. 

First we introduce four reduction rules and then we show some useful lemmas. 
Reduction rule Rl: 
if a yf 6 G / s.t. a — 1, 6 + 1 G J, exist then 
do / = / U {a — 1} — {a, 6}, J = J U {6} — {a — 1, 6 + 1} od 
Reduction rule R2: 

if a, 6, c G / are pairwise not equal s.t. a— 1,6— 1 ,c+2g J, a— lyfc+2yf6— 1, 
exist and c + 1 ^ / U J then 

do / = / U {a — 1, 6 — 1} — {a, 6, c}, J = J U {c, c + 1} — {a — 1, 6 — 1, c + 2} od 
The reduction rule R3 is symmetrical to the rule R2. 

Reduction rule R3: 

ifa,6,cG / are pairwise not equal s.t. a+ 1, 6+ 1, c— 2 G J,a+1 yf c— 2 yf 6+ 1, 
exist and c — 1 ^ / U J then 

do / = / U {c — 1, c — 2} — {a, 6, c}, J = J U {a, 6} — {a + 1, 6 + 1, c — 2} od 

Reduction rule R4: 

if /c, a G /, A: yf a, a — 1 G J then 

do / = / U {a — 1} — {A;, a}, J = J — {a — 1} 

The correctness of the rules follows from the equivalences: 

Rl : 2“ + 2** - 2““^ - 2^+1 = 2““^ - 2'’ 

R2 : 2“ + 2** + 2^= - 2““^ - 2^“^ - 2^=+^ = 2““^ + 2'’“^ - - 2° 

R3 : 2“ + 2** + 2^= - 2“+^ - 2^+^ - 2°“2 = 2'=“^ + 2°“2 _ 2“ - 2'’ 

R4 : 2^= + 2“ - 2“-i(mod 2'=) = 2“-h 

Therefore the rules R1-R4, if applied to a irepresentation of x, produce a new 
irepresentation of x. 
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Note that any application of the reduction rules R1 and R4 creates two new 
gap numbers a, 6 + 1 and k,a, respectively. Any application of the reduction rule 
R2 introduces three new gap numbers a, 6, c + 2 while the original gap number 
c + 1 is changed to a number from J i.e. by applying the rule R2 the number 
of gaps in the new irepresentation is increased by 2. Similar situation holds for 
the rule R3, where three new gap numbers a + 1,6 + l,c are created and the 
original gap number c — 1 is changed to the number from I. Again, the number 
of gap numbers in the new irepresentation is increased by 2. 

Now, we describe the application of the above rules on a basic representation. 
The following lemma shows a procedure how to get a new irepresentation where 
only one of the set of blocks B+, B~ , R+, B~ remains non empty and possibly 
B+ ^ 0, Rf ^ 0. 

Lemma 1. Let B^ yf 0 and B~ yf 0. We can reduce both sets of Lblocks to 
a new set of Lblocks of the same type as it was the type of the set of greater 
length. Moreover we get a set of Lblocks of length 1 each accompanied with a 
gap number. A similar proposition holds for all following combinations: B^ yf 0 
and R“ yf 0; R+ yf 0 and R+ yf 0; R+ yf 0 and R“ yf 0; R“ yf 0 and R+ yf 0; 
R“ yf 0 and B~ yf 0. 

Proof. Let Bf yf 0 and B~ yf 0. Let us apply the reduction rule Rl, as many 
times as possible so that we take a from a iblock of Bf and 6 from a iblock 
of B~ and a and 6 are the maximal not yet in Rl used numbers of I and 
J, respectively. We get new blocks of length 1. Furthermore, if in the original 
irepresentation \Bf\ = \B~\ then in the new irepresentation Bf = B~ = 0 
and if \Bf\ > \B~\ {\Bf\ < \B~\) we get a new irepresentation where Bf yf 0 
and B~ = 0 (R+ = 0 and B~ yf 0). There may be created at most one new 
±block which is of odd length, greater than 1. Let the maximum of the block 
be d. Then d,d — I G / {d,d — 1 G J if \Bf\ < \B~\) and the part of the 
iblock except d is a part of the original iblock from Bf . This new iblock is 
accompanied with two gap numbers - the gap number accompanying the original 
iblock and the gap number d + 1. Therefore, we can consider the new iblock to 
be an even length iblock with maximum d — 1 and accompanied with one gap 
number. Formally we can consider d as a iblock from B^ accompanied with a 
gap number too. □ 

One can see that the application of rule Rl affects each element of / U J 
of the basic irepresentation at most once and the gap numbers of the basic 
irepresentation are not affected at all. For the rest we need to process the still 
not affected part of the irepresentation. We discuss two cases R+ yf 0 and 
0 in the following two lemmata. 

Lemma 2. If in a ± representation Bf yf 0, Bf = B~ = B~ = 0 then we can 
create a new Lrepresentation where the number of gaps is at least #(JU J) — 3. 

Proof. Let R+ yf 0, Bf = Bf = Bf = 0. We can suppose that R+ does not 
contain any ±block of length 4t+ 1, t > 1. Otherwise, we can apply the reduction 
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rule R1 so that the a, a — l,b,b + 1 (see the rule Rl) are taken from the same 
±block of the length + 1. If we do it to maximal possible extent, we get for 
each such ibblock (except the one with k as & maximum) as many gaps as the 
number of elements in new / and J. We can also suppose that does not 
contain more than one ±block of length 3. Otherwise we can efficiently use the 
reduction Rl on the set of all the iblocks of length 3. If the number of the 
ffiblocks is odd, then one such iblock remains. 

It remains to analyse the case when all iblocks of are of length 4i + 3, i > 
1. On each iblock of B^ we can use the reduction rule Rl. We can do it in a way 
that after reduction we get for each such iblock u,u—l,u — i G I and u — 2 G J. 
With each such new iblock two gap numbers are accompanied: u+ 1 ^ ID J and 
the one which was accompanied with the original iblock (except the case when 
the maximum of the original iblock is equal to k.) If we get two such iblocks 
i.e. u,v exist such that: u,u—l,u — 3 G I,u — 2 G J,v,v — l,v — 3 G I,v — 2 G J (it 
holds: {u,u—l,u — 2,u — 3}r\{v,v—l,v — 2,v — 3} = 0), then we can apply the 
reduction rule Rl taking for example u—l,u — 2 from one iblock and v — 2,v — 3 
from the other iblock. Create as many as possible pairs of type u, v and apply 
Rl on all the pairs. After the latest reduction we get at least #(/ U J) — 1 gaps 
except in the case where the number of iblocks in B^ was odd. 

Now, let us analyze the odd case. Let k ^ I. By applying the same procedure, 
as it was used for the even case, we get #(/ UJ) — 2 gaps. Let fc G /. If fc G / and 
the number of iblocks is more than 1, then apply R4 so that a is a maximum 
of another iblock. Now we may have an even length iblock with maximum 
k—1 € J and one even length iblock of form: a — 1, a — 2, a — 3, a — 4, ...,v+ 1, v 
s.t. a — 1, a — 2, a — 4, ..., v € I, a — 3, a — 5 , ..., v + 1 € J and a + 1, a, u — 1 are 
gap numbers. If the length of iblock with maximum A: — 1 is shorter than the 
total length of the other iblocks— 1, then we can apply the reduction rule Rl 
so that in place of the iblock with maximum fc — 1 we will get as many gaps as 
the numbers from J, and this case is reduced to the previous one. We will get at 
least #(/ U J) — 2 gaps again. 

Let the length of the iblock with maximum fc — 1 be greater than the total 
length of the other iblocks— 1. Then we can use the reduction rule Rl again 
so that one couple (in the rule shown as b + l,b) is taken from the iblock 
with maximum k — 1 and the other one is taken from other iblocks. In the 
place of other iblocks we get the same number of gaps as #(/) and in the 
place of the original iblock with maximum A: — 1 we get a iblock of form 
u,u — l,u — 3, ..., V — 3,v — 1 G J, M — 2, M — 4, M — 6, ..., v — 2,v € I and we apply 
the rule Rl to it. 

Finally, two iblocks may rest: either d,d — 1 G J and g,g — 1 G I, then the 
number of gaps is at most by one less than \J J) or d,d— l,d — 3 G J and 
d—2,g,g+l G I and we have at least #(/ U J) — 3 gaps and we are done. Let the 
basic irepresentation contains only one ±block. Then the following reduction 
will take place: apply R4 so that a = k — 2 and then apply Rl to the rest of 
the iblock. In the worst case A: = 4z + 5 and there are 2i + 1 gaps and we are 
done. □ 
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Lemma 3. If in a ± representation = B^ =% then we can 

create a new ztrepresentation where the number of gaps is at least #(/U J) — 3. 

Proof. Let Bf ^ = Bf = B~ = 0. If /c G /, apply R4 so that a is a 

maximum of any other zhblock. We get an odd length zhblock B = k—1, k—2, ...,y 
s.t. k—1, k—3, k—5, y £ J and k—2, k—A, fc— 6, y+1 G / and one odd length 
±block of form: u — l,u — 2,u — 3,u — 4, ...,v + l,v s.t. u— l,u — 2,u — 4, ...,v G 
I,u — 3,u — 5, ...,v + 1 G J and y — l,u + l,u,v — 1 are gap numbers. Now we 
can use the rule R1 so that a = k — l,a— l = k — 2 and as 5, 6 + 1 can be taken 
from any other ±block - the only condition is that we take it from the shortest 
one and from its beginning. If the zhblock B is exhausted by the rule R1 or it 
was empty, we continue as it is in the case where k ^ If) J (see below) . 

If the length of the zhblock B is greater than the total length of the other 
iblocks (here we do not count the zhblocks of length 1) , then in the place of 
other zhblocks we get as many gaps as non gap numbers (we also count the 
original gap numbers). We can apply to the rest of the zhblock B the reduction 
rule R1 so that the 6 + 1,6 of the rule is taken from the beginning of the rest 
and a, a — 1 from the end of the rest of the zhblock B. Either we get a zhblock 
y, y — 1 G J or a zhblock y,y — l,y — 2,y — 3 where y,y — l,y — 3 £ J,y — 2 £ I. 
In the first case, there are as many gap numbers as non gap numbers in place of 
the original zhblock with the maximum k and we are done. In the second case, 
the number of gap numbers is smaller by two than the number of the non gap 
numbers in the place of the original zhblock with maximum k and we are done. 

Let k ^ I. Let us use the reduction rule R1 in such a way that a, a — 1 we 
take from the shorter zhblocks and 6 + 1, 6 i we take from the longer zhblocks. 

If one zhblock remains, then we apply the rule so that a, a — 1 is from the 
beginning of the zhblock and 5+1,6 from its end. In the worst case, a iblock 
remains composed of: a, a—1, a— 3 £ J and a— 2 G / such that a+1, a— 4 ^ /U J or 
a iblock composed of: a, a—1, a— 3 G I and a— 2 G J such that a+1, a— 4 ^ lU J. 
As there are at least #(/ U J) — 3 gaps, we are done. 

If two same length iblocks remain, then we apply the reduction rule R1 at 
the maximal possible extent (if the length is 2, we do not apply it) so that we 
take a, a— 1 from the first of the blocks and 6+ 1, 6 from the second of the blocks. 
In this case there are at least ff{I U J) — 3 gaps in the new irepresentation. 

If three same length iblocks remain, then we apply R2, if possible, so that 
we take c from the end of one of the iblocks and then we apply R1 similarly 
as in the previous case. If rule R2 is impossible to apply then there are at 
least 2 gaps behind each of at least two of these iblocks (it means that behind 
these two iblocks are in the basic irepresentation iblocks which are associated 
with 2 gaps - not only 1). By application of R1 at maximal possible extent we 
get (one should consider the above mentioned 2 superficial gaps from the basic 
irepresentation) a irepresentation with at least #(/ U J) gaps. 

If four iblocks remain, we can apply R2, if possible, and R1 in the similar 
way as in the case of three iblocks or we can argue similarly if R2 can not 
be used so that for these four iblocks the number of gaps is the same as the 
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numbers in / U J. It means that in the new irepresentation we have at least 
#(/ U J) - 1 gaps. 

Finally, if 4j + 0 < t < 3, iblocks remain, we can create a new 

±representation with at least #(/ U J) — 3 gaps so that we apply R2 and R1 j 
times in the way as for four blocks and then for the rest i blocks we apply the 
previous cases accordingly. □ 

The cases: B~ ^ 0, B+ = B~ = B+ = % and B~ ^ 0, B+ = B+ = B~ ^ % 
are symmetrical (we have to use R3 rule in place of R2) to the cases discussed 
in the Lemma 2 and in the Lemma 3, respectively. 

In the resulting irepresentation, if A: = 2/ then U J) < ^ + 1 and if 
k = 21 + 1 then #(/ U J) < / + 2. In both cases we have #(/ U J) < [(fc + 2)/2] . 

3.2 Odd Case 

Let X be an odd k-hit number. The basic irepresentation of x is created as 
follows: 

Let the three least significant bits of the binary representation of x be 001. 
Then the basic ± representation of x is defined as follows: 

m 

x = ^(2*‘ -2^*) +2^-3, 

t=i 

where k > ii > ji > %2 > J 2 > > jm > 3. We determine a new 

irepresentation so that we use the same procedure as in the case of x even 
to the k — 2 most significant bits of the binary representation of x. Such a 
irepresentation is at most \k/2'\ long. This irepresentation completed by 2^ — 3 
is a new Trepresentation of x s.t. its length is at most |"A:/2] + 1 = |"(fc + 2)/2]. 

Let the three least significant bits of the binary representation of x be 101. 
Then the basic irepresention of x is as follows: 

m 

x = ^(2*‘ -2^*) +2^-3, 

t=i 

where A: > Zi > ji > i 2 > J 2 > > jm > 4. According to the pre- 

vious subsection we can find a irepresentation of the k — 3 most significant 
bits of the binary representation of x s.t. its length is at most [(A: — l)/2]. This 
irepresentation together with 2^ — 3 is a new irepresentation of x s.t. its length 
is at most [(A: — l)/2] -|- 1 < [(A: -I- 2) /2] as k is odd. 

If the three least significant bits are Oil or 111 then 

m 

X = ^(2** — 2^*) + 2^ — 3(mod n), 

t=i 

such that k > j\ > i\ > j 2 > > ■■■ > jm > im > 2. Observe that 2^ — 

3(mod n) = 2^ — 1 sets ones in all bits of the binary representation of x and 2*‘ — 
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2-^‘, for each 1 < f < m, represents a block of zeroes in the binary representation 
of X. 

First, assume that the three least significant bits are 111. A irepresentation 
of zeroes in the k — 2 most significant bits is symmetrical to a irepresentation 
of ones in a /c — 2— bit even number. As such a irepresentation of length at most 
|"fc/2] exists, there also exists a irepresentation of the zeroes in the k — 2 most 
significant bits of the same length. This irepresentation together with 2^—3 
taken mod n is at most of length \k/2~\ + 1 = |"(fc + 2)/2] . 

Now, assume that the three least significant bits are Oil. If the most signi- 
ficant (fc — 1— st) bit of X is 1, then our problem is similar to the previous one: 
a Arepresentation of the zeroes in a fc — 2— bit odd number is symmetrical to a 
irepresentation of the ones in a fc — 2— bit even number. This Arepresentation 
together with 2^—3 taken mod n is at most of length |"fc/2] Al = |"(fcA2)/2]. 

If the most significant bit of x is 0, then the problem of representation of the 
fc — 1 most significant bits is symmetrical to the problem of representation of a 
fc — 1 bit even number with ones in the most significant bit and in the second 
least significant bit. In this case we can find a Arepresentation (see Lemmas 2 
and 3, parts of the proofs with application of the rule R4) where the number of 
gaps is at least equal to #(/U J) — 2. The symmetrical Arepresentation together 
with 2^ — 3 taken mod n is of length at most |"fc/2] A 1 = [(fc A 2)/2] . 



4 Lower Bound 

In this section we present the lower bound part of the proof giving examples of 
vertices whose distance from 0 is at least [(fc A 2)/2j . 

Let 0 < x < 2^ be an integer. Recall that the sequence of consecutive ones 
in the binary representation of x is called a block. 

Lemma 4. The number of blocks in a; A 2* (mod 2^), 0 < i < k is by at most 
1 greater than the number of blocks in x. 

Proof. Simple case analysis. □ 

Assume fc is even. Set x = 2^~^ A 2^“^ A 2^“® A .. A 2^. Then x has fc/2 
blocks. Let 

m 

X= ^(2** -2^*) (mod 2'=), 

t=i 

where 1 < it,jt < k,t = l,2,3...,m, be a shortest Arepresentation of x. Notice 
that / = and J = {ji, J 2 j niay be multisets now. 

1. If a, a A 1 G / or a, a G I then 2“ A 2“+^ (mod 2^) and 2“ A 2“ (mod 2^) 
has only one block. Starting with a, a A 1 or a, a and adding the remaining 
terms and noting Lemma 4, we can conclude that the number of blocks in x 
is at most 1 A 2(m — 1) > fc/2 which gives 2m > (fc A 2)/2. Similar analysis 
holds if / is replaced by J. 
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2. If a € / and b G J, a > b, exist, then 2“ — 2** has only one block and we 
continue as in the previous case. 

3. If the previous two cases do not occur, then it must hold that jm > jm-i > 

... > ji > im > im -1 > > h- Then clearly 



X = 



2k 



2 ^* + 2 ** = 2 ^ ^ + 2 ^ ^ + 2 ^ ^ 
t=l t=l 



.. + 2 \ 



This implies that it = 2t — 1, for t = 1, 2, ..., m and 2^ — X)™ i 2^* =2^ ^ + 
2'=-3 + ... + 22 '"+i. Then 2^‘ = 2^= - (2'=-i + 2'=-3 + ... + 22™+i) contains 

two consecutive ones in the binary representation, a contradiction. 

Let k be odd. Set x = 2^“^ + 2^~^ + 2*“® + .. + 2^ + 1. Then x has (fc + 1) /2 
blocks. Let 

m 

a;= (y]](2** -2^‘) + 2*“+i -3) (mod2'=), 

t=i 

where 0 < it,jt ^ k,t = 1,2, 3..., m, and 0 < im+i < fc be a shortest representa- 
tion of x. / = ...,im+i}, and J = {ji, ji, niay be multisets, again. 

Wlog assume that ii < Z 2 < •■. < im+i, and ji < j 2 < < jm- 

1. If there exists js € J s.t. tm-i-i > js, then the number 2*™+i — 2^® -|-2*i — 3, has 
3 blocks. Therefore, the total number of blocks in x is at most 3-|-2(m — 1) > 
{k + l)/2. This implies that the diameter is at least 2m -I- 1 > (fc -I- l)/2. 
Assume that the diameter equals (fc-l- 1)/2. Consider the number y = 2^~^ -G 
2fc-3 _|_ _|_ 22 ^ _ \')j2 blocks. Let 



y = y^(2*‘ - 2^*) (mod2'=), 

t=i 



be its shortest representation. Similarly, as in the case (fc even) we prove 
that the number of blocks in y is at most 1 -I- 2(m — 1) > (fc — l)/2, which 
implies 2m > (fc -I- l)/2. It means that the distance between the node 0 and 
y and between 0 and a; is (fc -I- l)/2. Consider the shortest paths between x 
and y in the graph. The length of this path is an odd number. Then the 3 
shortest paths 0 — x,x — y,y — 0 produce an odd-cycle, a contradiction, as 
our graph is bipartite. 

2. Finally assume that the previous case is not valid. If a, a -I- 1 G / or a,a G I 
or a, a + 1 G J , or a, a G J or a G I , a + 1 G J, we continue as in case fc even, 
l.(a) and conclude that the diameter is at least 2m -I- 1 > (fc-l- l)/2. Suppose 
that the diameter is (fc -I- l)/2. Consider the number y from the case fc odd 
1. Similarly, we prove that the distance between 0 and y is (fc -I- l)/2 and 
force an odd cycle, a contradiction. 

Hence, we may assume that jm > jm-i > > ji > im+i > im > im-i > 

... > ii and ji > im+i + 1- Then clearly 



x = 



2k 



2^'* -f 2** - 3 = 2'=-! -f 2'=-3 -H 2^-^ -f .. -f 2^ -h 1. 
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This implies that 2** - 3 = + ... + 2^ + 1 and 2^ - 

Y.7=i 2^* = 2'""^ + 2'=-3 + ... + 22"*. Then ^ 2^* = 2^= - (2'=-^ + 2'="3 + 
... + 22"*) contains two consecutive ones in the binary representation, a cont- 
radiction. 



5 Conclusions 

In spite of the fact that the Knodel graph is an important interconnection net- 
work and has important broadcast and gossip properties, its diameter has not 
been determined yet. In the present work we have shown that the diameter of 
the Knodel graph W 2 k is equal to [(fc -|- 2)/2]. Besides the Knodel graph there 
are two other examples of the minimum gossip and broadcast garphs on 2^ 
vertices: the well known hypercube and the recursive circulant graph [13]. In 
particular, the diameter of the recursive circulant graph is |"(3fc— l)/4]. The 
main reason for offering the recursive circulant graph as a new topology was its 
performance (number of vertices to diameter ratio) which could compete with 
the A:— dimensional hypercube. Since the Knodel graph ITfc. 2 '= has the diameter 
\{k + 2)/2], it could compete with both graphs. 

The more general question of determining the diameter of the Knodel graph 
Wn for any even n is still open. Open question is also how to find the shortest 
path for any pair of nodes of the Knodel graph. 

Finally, we note that our result (or rather proof technique) can be interpre- 
ted as a result from the area of computational arithmetic. In certain applications 
of modular arithmetic (for example in cryptography) it is important to quickly 
evaluate powers of the form where y is an element of a finite group and x is an 
integer exponent. One can speed up the computation by expressing a; as a short 
sum of terms of the form ±2*. In [2] a notion of optimal representation of this 
type is defined and a fast (linear time) algorithm to find such representations 
was constructed. In fact, the authors of [2] reduce the problem of finding shortest 
paths in an efficiently constructible directed acyclic graph. The latter problem 
can then be solved fast by Dijkstra’s well known algorithm. One of the conse- 
quences of our result is that the number of ±2* terms in such a representation 
of a /c— bit X is at most \k + 2/2'\. 
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Abstract. We introduce the domination search game which can be seen 
as a natural modification of the well-known node search game. Various re- 
sults concerning the domination search number of a graph are presented. 



1 Introduction 

Graph searching problems have attracted the attention of researchers from Di- 
screte Mathematics and Computer Science for a variety of nice and unexpected 
applications in different and seemingly unrelated fields. There is a strong resem- 
blance of graph searching to certain pebble games [13] that model sequential com- 
putation. Other applications of graph searching can be found in the VLSI theory: 
the game-theoretic approach to some important parameters of graph layouts 
such as cutwidth [17], topological bandwidth [16], bandwidth [9], profile [10] and 
vertex separation number [8] is very useful for the design of efficient algorithms. 
Also let us mention the connection between graph searching, pathwidth and tree- 
width. These parameters play a very important role in the theory of graph minors 
developed by Robertson and Seymour [2,7,20]. Some search problems also 
have applications in motion coordination of multiple robots [21] and in problems 
of privacy in distributed environments with mobile eavesdroppers (‘bugs’) [12]. 

In this paper, we introduce a domination search game which can be regarded 
as a natural modification of the well-known node search game. In node searching 
at every step some searchers are placed or are removed from vertices of a graph 
G. The purpose of searching is to find an invisible and fast fugitive moving from 
vertex to vertex in G. The searchers find the fugitive if some of them succeeded 
to occupy the same vertex as the fugitive. 

In the domination search game the searchers have more power, they find 
the fugitive if one of them can ‘see’ it, i.e. the fugitive stands on a vertex of 
the closed neighborhood of a vertex occupied by the searcher. For a survey on 
similar ‘see-catch’ problems on graphs with searchers having ‘radius of capture’ 
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Universitat Jena which was supported by a fellowship of the DAAD (Kennziffer: 
A/99/09594). 
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in different graph metrics as well as variants of ‘see-catch’ pursuit-evasion games 
on grids we refer to [11]. Domination searching can also be regarded as a version 
of the art-gallery problem with mobile intruder. 

The paper is organized as follows. In § 2 we give definitions and preliminaries. 
In § 3 we establish a relation between the domination search game and the well- 
known node search game. In this section we also observe some complexity results. 
In § 4 we discuss upper bounds for the domination search number that can be 
obtained by making use of spanning trees. In § 5 we present our main theorem 
which settles a very interesting connection between domination graph searching 
and a relatively new graph parameter called dominating target number. (Some 
proofs will be omitted due to space restrictions.) 

2 Preliminaries 

We use standard graph-theoretic terminology compatible with [3] to which we 
refer the reader for basic definitions. G = (V, E) is an undirected, simple (with- 
out loops and multiple edges) and finite graph with the vertex set V and the edge 
set E. We denote by G\W] the subgraph of G = (V,E) induced hy W QV. As 
customary we consider connected components (or short components) of a graph 
as maximal connected subgraphs as well as vertex subsets. A vertex set S' C y of 
a graph G is said to be connected if the subgraph of G induced by S is connected. 

We denote by G the complement of a graph G. The (open) neighborhood of 
a vertex v is N{v) = {u & V \ {u, u} G E} and the closed neighborhood of v 
is N[v] = N{v)U {v}. For a vertex set S C y we put N[S] = 

N{S) = A^[S] \ S. A vertex set D C y of a graph G = (V,E) is said to be 
a dominating set of G if for every vertex u € V \ D there is a vertex v G D 
such that {w,u} G E. Thus D is a dominating set iff N[D] = V. The minimum 
cardinality of a dominating set of a graph G is denoted by 7(G). We also say 
that A C y dominates B C V in the graph G = (V, E) ii B C A^[Aj. The 
distance da{u, v) between two vertices u and w of G is the length of the shortest 
path between u and v in the graph G. For a graph G = {V, E) let G^ be the 
graph with vertex set V and two vertices u and v are adjacent in G^ if and only 
if da{u, v) < k. 

For our purpose it is more convenient to describe the graph searching in terms 
of clearing graph vertices. Initially, all vertices are contaminated (uncleared or 
are occupied by invisible fugitive). A contaminated vertex is cleared once after 
placing a searcher on a vertex from its closed neighborhood. A clear vertex v 
is recontaminated if there is a path avoiding closed neighborhoods of vertices 
occupied by searchers leading from u to a contaminated vertex. 

More precisely, a domination search program II on a graph G = {V, E) is 
a sequence of pairs (also considered as the steps of II) {Dq, Aq), (Di, Ai), . . . , 
(L> 2m-1, A2m-i) such that 

1. for all i G {0, 1, . . . , 2m — 1}, Di CV and Ai CV] 

2. Do = 0, Ao = 0; 
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3. for every i G {1,2, . . . , m} at the (2i — l)-th step we place new searchers and 
clear vertices: I?2i-2 C -D2i-i and ^2i-i = ^2i-2 U -^[£'21-1]; 

4. for every i G {1, 2, . . . , m — 1} at the 2z-th step we remove searchers with 
possible recontamination: D 2 i-i D I?2z and A 2 i is the subset of A 2 i-i sa- 
tisfying that for every vertex v G A 2 i every path containing v and a vertex 
from V \ A 2 i-i contains a vertex from N[D 2 i\. If A 2 i-i \ A 2 i yf 0 we say 
that the vertices of A 2 i-i \ ^2* are recontaminated at the 2i-th step and that 
recontamination occurs at the 2i-th step. 

It is useful to consider Di as the set of vertices occupied by searchers and Ai as 
the set of cleared vertices after the ith step. Hence V\Ai is the set of contaminated 
vertices after the zth step. 

Notice that a program U = ((Do,^o), (£*1,^1), ■ • ■ , (-D2m-i, ^2m-i)) is fully 
determined by the sequence {Dq, Di, . . . , D 2 m-i) thus we shall mainly use the 
shorter description II = {Dq, ... , I?2 m-i)- All proofs in our paper could be given 
in terms of the formal definition of a search program U = {Dq, Di, . . . , D 2 m-i) 
but often we prefer a more ‘informal’ (but equivalent) way of proving. 

We call a domination search program 77 winning if ^2^-1 = V. A se- 
arch program 77 = {Dq, Di, . . . , D 2 m-i) is monotone if Ai C for all 

i G {0, . . . ,2m — 2}, i.e. no recontamination occurs. We say that a domination 
search program 77 uses k searchers if maxjg{o,...,2m-i} \Di\ = k. 

We define the domination search number by ds(G) = min/j maxjg{o,...,2m-i} \Di\ 
where the minimum is taken over all winning programs 77. 

Problem 1. Is there a graph G such that every monotone winning domination 
search program on G uses more than ds(G) searchers? In other words, does 
recontamination help to search a graph? ^ 

Notice that the domination search number of a disconnected graph G is 
equal to the maximum domination search number taken over all components of 
G. Thus we may assume our considered graphs to be connected. 

Obviously ds(G) < 7(G). To establish a more interesting relation, let G = 
(V,E) be a graph and let S'2(G) = (V (S 2 {G)) , E{S 2 {G))) be the graph obtained 
from G by replacing every edge {u, u} of G by an (u, z;)-path of length three. We 
call the vertices oiV C F (S'2 (G) ) original vertices and the vertices of H(S'2 (G) ) \ 
V middle vertices. 

Theorem 1. For any graph G = {V,E), let 77 be the graph with vertex set 
V{S 2 {G)) and edge set 7? U 7f(S'2(G)), i.e. 77 is obtained from G by connecting 
every two nonadjacent vertices by a path of length three. Then 

7(G) < ds(77) < 7(G) + 1 . 

Proof. Let 77 be a dominating set in G. We first put |77| searchers on the vertices 
of 77 in 77 and then using one additional searcher we clear all middle vertices of 
S'2(G). Hence ds(77) < 7(G) -I- 1. 

^ During discussions at WG’2000 Stefan Dobrev (Bratislava, Slovakia) provided an 
example showing that recontamination may indeed help. 
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Let us prove now that 7(G) — ! searchers cannot clear the graph H by showing 
that at every step of searching there exists a contaminated original vertex. In 
fact, let n = {Dq, Di, . . . , be a domination search program on the graph 

H using at most 7(G) — 1 searchers. |L*i | < 7(G) — 1 implies that there is at least 
one contaminated original vertex after the first step of 77. Suppose that for some 
i G {1,2 ,..., m} there is contaminated original vertex v after the (2t— l)-th step. 
Hence v ^ N[D 2 i-i]- If at any later step 2j — 1, j > i, for the first time since 
the (2z — l)-th step a searcher is placed on v or one of its neighbors (in 77), then 
there is an original vertex w ^ N[D 2 j-i]- Notice that either w is adjacent to v 
or there is a (v, rc)-path of length 3 in 77 with two interior middle vertices such 
that no vertex of this path belongs to N[D 2 j- 2 \- Therefore w is contaminated 
after step 2j — 1. □ 



3 Node Search Number 

The domination search game can be regarded as a generalization of the well 
known node search game, see [2] for a survey. In the node search game at every 
step some searchers are placed or are removed from vertices. Initially, all vertices 
are contaminated. The difference between node and domination searching is that 
a contaminated vertex v in node searching is cleared once after placing a searcher 
on V. A clear vertex v is recontaminated if there is a path avoiding vertices 
occupied by searchers leading from u to a contaminated vertex. 

The following theorem establishes a relation between node searching and 
domination searching. 

Theorem 2. For any graph G = {V, E), ns(G) = ds(S'2(G)). 

Since the problem node search number is NP-complete, Theorem 2 implies 

Corollary 1. The problem Domination Searching.' ‘Given a graph G and 
an integer k, decide whether ds(G) < k or not’ is NP-hard. 

Finally by combining a result on the non-approximibility of the domination 
number by a factor of clogn for some c > 0 [19] with Theorem 1 we obtain 

Corollary 2. There is a constant c > 0 such that there is no polynomial time 
algorithm to approximate the domination search number of a graph within a 
factor of clogn, unless P = NP. 

4 Spanning Trees 

Let S = {vi,V 2 , ■ ■ ■ , Vn) be a vertex ordering of a graph G = (V, E). The width of 
the ordering <5 of G is b(G, <5) := max{|7 — j\: {vi,Vj} G E}, and the bandwidth 
of G is bw(G) := min|b(G, 5) : (5 is a vertex ordering of G}. 

In the proof of the next theorem we use a theorem of Ando et al. [1] stating 
that for any tree T with I leaves bw(G) < \^l~\- We denote the set of leaves, 
i.e. vertices of degree 1, of a tree T by Vi{T). 
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Theorem 3. Let T = {V, E{T)) he a spanning tree of a graph G = {V, E) such 
that G C and let I be the number of leaves of the tree Ti = T — Vi{T). 

Then 



ds(G) < \^lMk + l) + l. 

Proof. Let S = (vi,V 2 , ■ ■ ■ ,Vr) be an ordering of the vertices of Ti such that 
b(Ti, i5) <b = \\V\. Note that such an ordering exists by [1]. 

Our domination search program works as follows. We put searchers on the 
first b{k+ 1) + 1 vertices of i5. Then we remove the searcher from v\ and place it 
on Uf,(fc+i)+ 2 - Suppose that after removing the searcher from v\ recontamination 
occurs. Hence there is a vertex x such that {a;,?;!} G E and there is a conta- 
minated vertex y such that {x, y} G E. Since searchers occupy all vertices with 
indices at most b{k -h 1) -h 1 unless vi, there is a vertex Vi, i > b{k -h 1) -I- 1, such 
that y G Nxlvi]. Now G C implies that the (x,y)-path in T has length at 
most k + 1. 

We distinguish three cases. In Case 1 we assume that x is a leaf of T and 
{x,vi} G E{T). Then the length of the (ui, t!i)-path in T is at most k since 
the (x,j/)-path in T passes through v\. In Case 2 we assume that x is a leaf of 
T and {x,xi} ^ E(T). Hence the neighbor of x in T is a vertex vj for some 
j > b{k -I- 1) -I- 1. G C implies dT{v\,x) < fc -I- 1. This implies that the 
length of the (xi,Xj)-path in T is at most k since the (vi,x)-path in T passes 
through Vj. Finally in Case 3 we assume that x is not a leaf of T. Therefore 
X = Vj for some j > b{k -I- 1) -I- 1. Consequently {vi,vj} G E and we obtain 
dT{vi,Vj) < fc -I- 1. 

Therefore in all three cases there is a j > b{k -I- 1) -I- 1 such that dT{vi,Vj) < 
fc -I- 1. By ‘pigeonhole principle’ there is an edge {vp,Vq} in the (vi,Xj)-path of 
T such that \p — q\> h which contradicts the choice of the ordering 5. 

By the same arguments we can remove a searcher (without recontamination) 
from V 2 and put it on Xh(fc_|_i)_|_ 3 , and so on. Finally, every vertex of G is cleared 
once by a searcher because T is a spanning tree of G. Since no recontamination 
occurs when a searcher is placed on all vertices of G are cleared. □ 

If the spanning tree T is a caterpillar then the results of Theorem 3 can be 
slightly improved. A caterpillar is a tree which consists of a path, called the 
backbone, and leaves adjacent to vertices of the backbone. 

Theorem 4. Let T he a spanning caterpillar of a graph G and let k be an integer 
such that G is a subgraph o/T^+^. Then ds(G) < max{2,fc}. 

Proof. The backbone P = (vi,V 2 , . ■ . , Vm) of T is a dominating path of G, i.e., 
every vertex of G is adjacent to a vertex of P. First we suppose fc > 2. Consider 
the following domination search program LI using fc searchers. Initially we put 
searchers on the first fc vertices of P. Then we remove the searcher from v\ and 
put it on Xfc+i, then remove searcher from V 2 and put it on Vk +2 and so on. 

Let us show that 7T is a monotone program. Assume the converse. Suppose 
that after removing a searcher from a vertex, say Vj, the first recontamination 
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occurs. Then there is a vertex x such that {x,vj} G E and {x,y} G E for some 
contaminated vertex y. Because y is not cleared yet, we conclude that y G Nxlvi] 
for some i > j + k — 1. Moreover, if y = Vi then i > j + k. Therefore the length 
of the {x, y)-path in T is at least k + 2. But this contradicts the definition of T; 
hence 77 is monotone. Since every vertex of G was once cleared by a searcher we 
obtain that 77 is winning. 

If fc < 1 then G C C C and two searchers are sufficient to clear 
G as shown above. □ 

An independent set of three vertices is called an asteroidal triple if every 
two of them are connected by a path avoiding the neighborhood of the third. 
A graph is AT-free if it does not contain an asteroidal triple. Asteroidal triples 
where introduced to characterize interval graphs and comparability graphs, see 
[4] for references. In their fundamental paper [6] Corneil et al. investigate AT- 
free graphs. Among others properties they prove that every connected AT-free 
graph contains a dominating pair, i.e. a pair of vertices u and v such that every 
{u, u)-path is dominating. A dominating pair of a connected AT-free graph can 
be detected by a simple linear time algorithm [5]. In [14] it was shown how to 
use a dominating path found by that algorithm as backbone of a caterpillar T 
such that T C G C T'^. We obtain the following corollary of Theorem 4. 

Corollary 3. Let G he an AT-free graph. Then ds(G) < 3. 



Conjecture 1. Let G be an AT-free graph. Then ds(G) < 2. 

For some classes of AT-free graphs we are able to prove that the domination 
search number of every graph in this class is at most two. 

A graph G is a comparability graph if and only if G has a transitive orienta- 
tion of its edges. Cocomparability graph are the complements of comparability 
graphs. Every interval graph is a cocomparability graph, and every cocompa- 
rability graph is AT-free, see [4] for references and a survey on different graph 
classes. 

Theorem 5. Let G be a cocomparability graph. Then ds(G) < 2. 

A graph isomorphic to K 13 is referred to as a claw, and a graph that does 
not contain an induced claw is said to be claw-free. 

Corollary 4. The domination search number of AT-free claw-free graphs is at 
most two. 

Parra and Scheffler [18] proved that for every AT-free claw-free graph G, 
ns(G) — 1 = bw(G). Combining this and Corollary 4 one can obtain the following 
interesting result. 



Theorem 6. Let G be an AT-free claw-free graph. Then 
iZ\(G) < bw(G) < 2A{G) - 1. 
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5 Dominating Targets 

We start with a result on graphs with dominating pair, a class of graphs con- 
taining as we have mentioned all connected AT-free graphs. Let us recall the 
definition of a dominating pair. A dominating pair is a pair of two (not necessa- 
rily different) vertices u and u of a connected graph G such that the vertex set 
of every (u, u)-path in G is a dominating set of G. 

Lemma 1. The domination search number of a connected graph with domina- 
ting pair is at most 4. 

Proof. Let c?i, ^2 be a dominating pair in G and let P be a shortest (di, d 2 )-path. 
Then P is the backbone of a spanning caterpillar T in G. Since P is a shortest 
path, we have that G C P®. Then by Theorem 4, ds(G) <4. □ 

Problem 2. Determine the maximum domination search number of a graph with 
dominating pair. 

Dominating targets have been introduced in [15] as a generalization of domi- 
nating pairs. A dominating target is a vertex set T C y of a connected graph 
G = (V,E) such that every connected superset of T is a dominating set. The 
dominating target number of a graph G, denoted by dt(G), is the smallest size of 
a dominating target of G. Hence graphs with dominating pair have dominating 
target number at most 2. The following theorem that extends Lemma 1 is one 
of the main results of our paper. To prove it we shall need some technical results 
about dominating targets as well as a notion of a dominating target for a vertex 
set of a graph. 

A vertex set P C y of a graph G = (y E) is a dominating target for a vertex 
set B CV in G if B C iV[S'] for every connected superset S of P. Clearly P is a 
dominating target of a graph G = (y E) if and only if P is a dominating target 
for y in G. 

Lemma 2. Let T be a dominating target for a vertex set B C V of a graph 
G = (y E) and let D be a vertex set. Then for any vertex v € B\ N[T U D] the 
following two statements hold. 

1. The number of connected components of G — iV[v] containing vertices of T 
is at least two. 

2. Let G\,G 2 , ■ . ■ ,Gk be the connected components of G — N[v] withTtlGi yf 0. 
For every i € {1, 2, . . . , fc} let yi € A^[u] be a vertex with N[yj] D Ci ^ 0. 
Then for every i G {1, 2, . . . ,k} the set Ti = (T D Gi) U {u} is a dominating 
target for the vertex set Bi = {B C\ Gi) \ N[Di] in the graph G[Gi U fV[u]], 
where Dt = {D C\ {Gi U A^[u])) U {u} U {yj : j yf i}. 

Proof. Let v £ B \ N[T U D]. Suppose there is a connected component G of 
G — N[v] with P C G. Then G is a connected superset of P that does not 
dominate the vertex v — a contradiction. Hence at least two components of 
G — N[v] contain vertices of P. 
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For the proof of the second statement suppose that Ti is not a dominating 
target for Bi in the graph G[Ci U Then there is a connected superset 

-5'i C Ci U -/V[n] of Ti and a vertex w & Bi that has no neighbor in Si. We extend 
Si to a connected superset S' of T in G by adding for every j ^ i the vertex 
i/j and (for simplicity) Cj. Hence w G B is not adjacent to a vertex of S which 
implies that T is not a dominating target for B in G — a contradiction. □ 



Theorem 7. For every connected graph G, ds(G) < 2dt(G) + 3. 

Proof. By induction on fc = |T|, we prove the following stronger statement that 
implies the theorem when taking a dominating target T of G such that |T| = 
dt(G), i.e. B = V and D = 0. 

Let T be a dominating target for B C V in the graph G = {V, E) and let 
D QV he a vertex set such that N[D] V\B. Then there is a winning 
domination search program II on G using at most |H| + 2|r| + 3 searchers 
such that at the first step of 77 searchers are placed on all vertices of 
D and after that on each vertex of D there is a searcher throughout all 
steps of the search program 77. 

If T is a dominating set of G[B] then T U 77 is a dominating set of G. Hence 
the statement is true since we simply place searchers on all vertices of TDD. 
Notice that \T\ = 1 implies that T is a dominating set of G[77]. 

If |T| = 2 then we place a searcher on each vertex of D and by Lemma 1, 
we can clear all vertices of B using 4 additional searchers to be moved along a 
shortest path P between the two vertices of T in G. 

Suppose inductively that for all 77 C H and all dominating targets T for B in 
G with N[D] ^V\B the statement is true if |T| < 7—1. Let T be a dominating 
target for 77 C y in the graph G = (V,E) with \T\ = k and let 77 C H be a 
vertex set such that TV [77] V \B. 

We may assume that T U 77 is a not a dominating set of G[77], thus S = 
77 \ TV[ru 77] 0. Then by the first part of Lemma 2, for every vertex v G S at 

least two components of G — TV[u] contain vertices of T. 

Consider first the easy case in which there is a vertex v G S such that every 
component of G — N[v] contains at least 2 vertices of T. Let Gi, G 2 , . . . , Gm, 
m > 2, he the components of G — N[v] with T D Gi yf 0 such that k — 2 > 
|Gi n Tj > ]G 2 n T] > . . . > \Gm n Tj > 2. For every j S {1,2,..., m} choose a 
vertex yj G TV[v] with N[yj] (iGj yf 0. 

The domination search program works as follows. We place searchers on all 
vertices of 77 and a searcher on v that will never be removed. Then the compo- 
nents Gi, G 2 , . . . , Gm are cleared individually one by one, where each component 
Gi is cleared as follows: 

Place searchers on all vertices yj with j ^ i. By the second part of Lemma 2, 
we may conclude that Ti = {T D Gi) U |u} is a dominating target for the vertex 
set Bi = ((77 n Gi) \ \Jj^i N[yj]) \ N[Di] in the graph G[Gi U N[v]] where A = 
(77n(GiUTV[ui]))U{u}U{2/y : j yf 7} satisfies TV[77i] 0 Gi\Bi. Since [GiflTj-l-l < 
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k — 1 the graph G[Ci U iV[w]] can be searched (resp. Ci can be cleared) using 
2{\Ci n r| + 1) + 3 + \Di\ searchers by our inductive assumption. 

Since the components are cleared individually and \Di\ < \D\ + 1 + (m — 1) 
we have that the number of searchers needed is at most \D\ + m + 2{\Ci nT| + 
1) + 3. Each component Ci contains at least two vertices of T and we obtain 
\Ci n T| + m < k. Combined with \C\ fl T| < fc — 2 this implies that the number 
of searchers is at most \D\ + 2k — 2 + b = \D\+2k + 3. 

Now we can concentrate our efforts on the hard case in which for every vertex 

V & S = B\N\T D] at least one component of G — N[v] contains exactly one 
vertex of T. 

We say that a vertex u G S' is t-separating in G for a vertex t € T ii there is 
a component G of G — such that G fl T = {t}. 

Suppose that the dominating target T for B in G contains a vertex t for 
which no vertex v G B is t-separating. Then T \ {t} is a dominating target for 
i?\fV[t] in the graph G. Thus placing a searcher on vertex t and then inductively 
using the claim for a dominating target of cardinality k — 1, we easily obtain 
that 1 -I- \D\ -I- 2 A: -I- 1 = \D\ + 2k + 2 searchers are sufficient to search G. 

From now on we may assume that for every vertex t G T there is a vertex 

V G S which is t-separating. Notice that every vertex v G S is t-separating 
for some t G T (otherwise we are in the easy case) and that a vertex v may 
be t-separating for various vertices of T. It will be convenient to assume that 
T = {ti,t2, . . . ,tfc}. 

For every i G {1,2,... ,k}, let B(ti) be the set of all vertices v G S that 
are t^-separating. Notice that R{ti) ^ 0 for all i G (1,2,... ,fc} and that 

For vertices a,b G R{ti) we say that a b if a and U are in one component 
of G — iV[6] but b and ti are not in one component of G — N[a]. For every 
t G {1, 2, . . . , k}, we choose a ^i-maximal element Vi, i.e. a maximal element of 
the partially ordered set (R{ti),~<i). (Notice that Vi = vj for i j is possible.) 
Let Ci be the component of G — containing ti, i.e. T fl G^ = {ti}. 

The domination search program works as follows. We place searchers on all 
vertices of D and on all vertices of {vi,V 2 , ■ ■ - Vk} that will never be removed 
throughout the search. 

Then the components Gi, G 2 , . . . , G^ will be cleared individually one by one, 
where each component Ci is cleared as follows. Let G{, G^, . . . , G),^. be all com- 
ponents of G — A^[ui] containing a vertex of T except the component Ci, thus 
nii < k — 1. For each component G*, j G (1,2,... ,rrii}, we choose a vertex 
y} G N[vi] n N{Cj). By the second part of Lemma 2 and similar to the easy 
case we may conclude that for every i G (1,2,... ,k}, {vi,ti\ is a dominating 
target for the vertex set Bi = {{B fl Ci) \ \Jj^iN[yj]) \ N[Di] in the graph 
G[Ci U IV[ui]] where Di = {{D fl {Ci U iV[wi])) U |ui} U {y} '■ j ^ i} satisfies 
N[Di] D Ci\ Bi. We place searchers on all the vertices y\,y\,... ,yln.. Finally 
as consequence of Lemma 1 (already obtained at the beginning of this proof), 
four additional searchers are sufficient to clear Ci. Finally we remove all searchers 
from y\,y\,... , y)^. . Altogether this clears Ci. 
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Hence all components C\,C2, ■ ■ ■ ,Ck can be cleared by using at most \D\+k+ 
(fc — 1)+4= |_D| + 2fc + 3 searchers. After clearing the components C\,C2, ■ ■ ■ ,Ck 
there remain \D\ + k searchers on G. The current set of cleared vertices is 
N[D] U Ui<*<fc C'i U Ui<»<fc^N- Since N[U] C C, and Ui<*<fc = ^ \ 
A^[T U D] we may conclude that all contaminated vertices belong to the set 
U = (Ui<.<fc m) \ (A'N U Ci)) \ N[D]. 

Let us show how the vertices from the set U can be cleared by placing k 
additional searchers on vertices of G. For every i G {1,2,... ,k}, we choose 
a vertex Ui G N[vi] having a neighbor in Gi. We claim that every vertex of 
U is adjacent to a vertex of the set {u\,U 2 ,--- ,Ufc} which would imply that 
additional k searchers placed on all vertices of {ui,U 2 ,--- ,Uk} together with 
the k searchers already placed on {rii, V 2 , ■ ■ ■ , Vk} will clear all vertices of U . In 
fact, choose a vertex w G U. 

Then w G R{1) for some I G (1,2, ...,fc}, thus w is ^-separating. Choose 
a (tpW/)-path P having all vertices in Gi except ui and vi. Since vi is a 
maximal element, vi does not belong to the component of G — N[w] containing 
ti and therefore w is adjacent to a vertex p of the path P. On the other hand, 
since w ^ GiU we may conclude that p ^ Gi. Therefore p = ui and the 
claim follows. 

Finally we place searchers on all vertices of |ui, U 2 , ■ ■ ■ Ufc} thus clearing ver- 
tices from U by our above claim. This needs at most 2k + \D\ searchers. 

Summarizing, the number of searchers needed is at most \D\ -|- 2fc -|- 3. This 
completes the proof. □ 



Corollary 5. For every disconnected graph G, ds(G) < 3 -I- 2 • maxcdt(G), 
where the maximum is taken over all components G of G. 

To illustrate the strength of Theorem 7 let us consider the complete graph G 
on n vertices and the graph 52 (G) obtained from G by replacing each edge 
by a path of length 3. Clearly dt( 52 (G)) = n and by Theorem 2 we have 
ds(52(G)) = n. This leads to the following conjecture which is up to our know- 
ledge the strongest possible strengthening of our theorem. 

Conjecture 2. ds(G) < dt(G) for all graphs G. 
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Abstract. We consider the problem of disseminating messages in net- 
works. We are interested in information dissemination algorithms in 
which machines operate independently without any knowledge of the 
network topology or size. Three communication tasks of increasing dif- 
ficulty are studied. In blind broadcasting (BB) the goal is to communi- 
cate the source message to all nodes. In acknowledged blind broadcasting 
(ABB) the goal is to achieve BB and inform the source about it. Fi- 
nally, in full synchronization (FS) all nodes must simultaneously enter 
the state terminated after receiving the source message. The algorithms 
should be efficient both in terms of the time required and the communi- 
cation overhead they put on the network. We limit the latter by allowing 
every node to send a message to at most one neighbor in each round. 
We show that BB is achieved in time at most 2n in any n-node network 
and show networks in which time 2n — o(n) is needed. For ABB we show 
algorithms working in time (2-|-e)n, for any fixed positive constant e and 
sufficiently large n. Thus for both BB and ABB our algorithms are close 
to optimal. Finally, we show a simple algorithm for FS working in time 
3n.The optimal time of full synchronization remains an open problem. 



1 Introduction 

Broadcasting is the task of disseminating a message from a source node to all 
other nodes of a communication network. The goal is to complete the task as 
fast as possible using a moderate amount of communication in the network. 

Broadcasting is an important and basic communication primitive in many 
multiprocessor systems. Application domains of broadcasting include scientific 
computation, network management protocols, database transactions, and multi- 
media applications. Due to their widespread use, broadcasting algorithms need 
to be efficient in terms of time and network communication. The dissemination 
of the source message should be accomplished quickly, without posing too heavy 
communication load on the network. 

We are interested in broadcasting (and more general information diffusion) 
algorithms which operate correctly and efficiently when there is no centralized 
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control in the network, and machines operate independently without any a priori 
knowledge of the network topology or even size. 

We assume that communication proceeds in synchronous parallel rounds. A 
round is the time necessary for a machine to transfer a message to a neighbor. A 
message sent by a node in a given round becomes available at the neighbor in the 
next round. The number of rounds necessary to carry out a given information 
dissemination task is called the time needed for this task. 

Under our hypothesis, even the size of the network is unknown. Therefore, 
a first problem is how the machines know that the dissemination process is 
completed. It could be very well finished but no node may be aware of this fact. 
This is different from the scenario when all necessary information about the 
network is available to nodes. In that case the time of broadcasting can be known 
in advance and thus all nodes can be aware of the termination of broadcasting 
as soon as it is completed. Since this is not the case in our setting, we study the 
following three information dissemination tasks of increasing difficulty. 

— Blind Broadcasting (BB): the goal is simply to communicate the source 
message to all nodes. 

— Acknowledged Blind Broadcasting (ABB): the goal is to achieve BB 
and inform the source that the process is terminated. 

— Full Synchronization (FS): all nodes must (simultaneously) enter the state 
terminated after receiving the source message. 

ABB may be crucial, e.g., when the source has several messages to disse- 
minate and nobody should learn the next message until everybody learns the 
previous one. FS implies common knowledge about completion of broadcasting 
(all nodes know that everybody got the message, they know that everybody 
knows that everybody got the message, etc.). It is the first time when a concer- 
ted action of all nodes can be undertaken after learning the source message. 

The BB, ABB, and FS communication tasks are closely related to some infor- 
mation diffusion problems which have been extensively studied in the literature. 
Blind broadcasting is related to Propagation of Information introduced in [17]. 
Acknowledged blind broadcasting is related to Propagation of Information with 
Feedback (PIF) [17], also called broadcasting with echo [18]. Full synchronization 
is related to the Firing Squad Problem (cf., e.g., [18]). 

The main difference between our approach and that in [1,17,18] is that while 
the latter focus on formal description and validation of protocols and on the 
message complexity for these tasks in the asynchronous setting, in the present 
paper we study the worst case number of rounds needed to complete these tasks. 

Moreover, we stress that the tasks described above are classically accomplis- 
hed by flooding the network, that is, by protocols in which nodes send at each 
step all the information they know to all of their neighbors (e.g. [1]). This im- 
plies that the communication complexity of such algorithms is very large, thus 
resulting in a significant communication overhead placed on the network. We 
study algorithms that place less overhead on the network while keeping low the 
number of rounds. 
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1 . 1 Related Work 

Coping with incomplete information in the design of network algorithms is an 
important problem whose many variations have been extensively studied. 

In particular, Topology Discovery and Network Measurement constitute a 
class of problems, recently studied in the literature, related to the need of gat- 
hering information about an unknown network structure; well studied tasks in 
this area are the discovery of the physical topology of the network or the router- 
level map of the Internet [4,10]. The above tasks are considered of fundamental 
importance for an efficient solution of many critical network management pro- 
blems. One general requirement for discovery algorithms is the need of being 
executed without imposing a significant overhead on the network. In particular, 
an important problem in this area, the Resource Discovery problem in which 
machines in a network initially unaware of each other need to learn both the 
existence of each other and the topology of the entire network, has been recently 
studied under communication assumptions similar to the ones we use in this 
paper [11]. Some of the techniques we develop apply also to the general problem 
of learning the topology of the network. 

Recently, a few papers [6,7,8] have been devoted to the study of broadcasting 
in networks in which neither edges nor nodes have a priori assigned labels, and 
thus in networks both anonymous and devoid of sense of direction. However, 
in these papers it was assumed that nodes know the topology of the network, 
although they lack some information necessary to orient it, and thus to carry 
out broadcasting in the most efficient way. 

While randomized broadcasting protocols have been studied under assump- 
tion that nodes are ignorant of the network (cf., e.g., [3,9]), deterministic bro- 
adcasting in point-to-point networks has been always considered assuming at 
least partial knowledge of the network (see the survey papers [13,14,15]). For 
example in [2] the number of messages needed for broadcasting was studied as- 
suming that nodes know at least their immediate neighborhood. To the best of 
our knowledge, ours is the first paper devoted to the study of time of determini- 
stic information dissemination algorithms in arbitrary point-to point networks, 
under network ignorance scenario (see [12] for similar investigations in specific 
topologies and [5] for similar investigations in arbitrary radio networks). 



1.2 Our Results 

Flooding based algorithms present the disadvantage of putting a significant com- 
munication overhead on the network. In order to reduce the number of messages 
sent in each round we will consider algorithms in which each machine is allowed 
to send a message to at most one of its neighbors in each round. We refer to the 
above reduced communication power of the algorithms as the restricted commu- 
nication model (cf. a similar assumption in [11]). We study the minimum time 
(i.e., number of rounds) needed for the BB, ABB, and FS tasks under this model. 

Since nodes are ignorant of network topology, they cannot distinguish bet- 
ween yet unused adjacent links and consequently the decision on which of them 
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the message should be sent belongs to the adversary, similarly as in [6,7,8]. We 
consider the worst case time, i.e., the time resulting from the most detrimental 
behavior of the adversary. 

For BB we show an asymptotically optimal algorithm. More precisely, we 
show that BB is achieved in time at most 2n in any n-node network and show 
networks in which time 2n — o(n) is needed for BB (and hence also for the 
other two tasks). For ABB we show algorithms achieving this goal in any n-node 
network in time (2 -|- e)n, for any fixed positive constant e and sufficiently large 
n. These algorithms use messages of logarithmic size. Using large messages, the 
source can additionally learn the entire topology of the network and time can be 
reduced to 2n. Finally, we show a simple algorithm for FS working in time 3n 
for any n-node network. This algorithm also uses messages of logarithmic size. 
We also mention that we can reduce this time to slightly less than 2.9n, using 
large messages, and simultaneously inform all nodes about the topology of the 
network. The modification details will be given in the full version of the paper. 

2 Terminology 

A network is modelled by a simple connected undirected graph with a distin- 
guished node called the source. The distance between two nodes is the length of 
the shortest path between them. The depth of the network is the largest distance 
between the source and any other node. The ith level of a network is the set of 
all nodes at distance i from the source. Clearly a node in level i can be adjacent 
only to nodes in levels i—l,i and i+1. Every node in level i is adjacent to a node 
in level i — 1. Call an edge saturated if the source message has been transmitted 
on it in at least one way. 

3 Blind Broadcasting 

The following observation will be essential in further considerations. 

Proposition 1. Accomplishing BB in a communication network is equivalent 
to transmitting the source message on each link in at least one direction. 

A proof can be found in [16]. A link on which the source message has been 
transmitted will be called saturated. Establishing an upper bound for BB is 
equivalent to showing that all edges of any network will be saturated in the given 
number of rounds, regardless of the behavior of the adversary, while establishing 
a lower bound is equivalent to constructing a network and showing an adversary 
strategy for it which prevents saturation of all edges before the given time. 

The following natural algorithm for BB is asymptotically optimal. 

Protocol BB 

Phase 1. The source sends its message on all adjacent edges in consecutive 
rounds starting in round 1. 

Phase 2. Every node v that got the source message sends the source message in 
consecutive rounds on all yet unsaturated edges (in any order) and then stops. 
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Theorem 1 . For any n-node network of depth d, Protocol BB accomplishes BB 
in worst case time 2n — d — 2. 

Proof. Let s be the source and Li, for i = 1, d, be the ith level of the network. 
Denote by Oi the size of Li. After oi rounds since the beginning of broadcasting, 
all edges between the source and Li are saturated. After at most oi — 1 + 02 
further rounds, all internal edges in Li and all edges between Li and L2 are 
saturated. After at most 02 — 1 + 03 further rounds, all internal edges in L2 and 
all edges between L2 and +3 are saturated. Proceeding in this way we conclude 
that after at most oi + (oi — 1 + 02) + (02 — 1 + 03) + • — h (od-i — 1 + ad) rounds, 
all edges of the network, except possibly the internal edges in Ld, are saturated. 
Further 0^—1 rounds are sufficient to saturate all these edges. Thus total time 
is at most 2(oi H + Od) — d = 2{n — 1) — d = 2n — d — 2. □ 

Theorem 2 . For any positive integer n and any algorithm for BB, there exists 
an n-node network and an adversary strategy for which this algorithm requires 
time 2n — o{n). 

Proof. We assume without loss of generality that no message is sent on a sa- 
turated edge. Whenever a node sends a message, the adversary chooses among 
unsaturated edges adjacent to v. 

We show the argument for n of the form 4 fc^. Modifications in the general 
case are easy. First we define the network. The set of all n nodes is divided into 
m = 2k disjoint sets Ai,...,Am of size m. Edges join every pair of nodes in Ai, 
for all i = 1, ..., m, and every pair u, v of nodes, such that u & Ai and v € 
for all z = 1 , ..., m — 1 . Let Ai = {xq, ..., Xm-i}- The source is Xq. 

Next we define the adversary strategy. In the ith round, i = 1, ...,m — 1 , all 
nodes xq, ...,Xi-i send a message to Xi. In the mth round every node Xi G Ai 
sends a message to a different node yi G A2. In the next m — 1 rounds the 
adversary strategy is as follows. In round m + j, j = I, ..., m — 1 , nodes Xi and 
Vi-i-ji mod m)5 * = 0 , ..., m — 1 , Send messages to each other. Hence, after 2m — I 
rounds since the start of broadcasting, saturated edges are exactly those between 
pairs of nodes in Ai and between Ai and A2. Moreover, informed nodes with 
adjacent unsaturated edges are exactly those in A2. 

It is well known that the set of edges of a complete graph of even size m can 
be partitioned into m — 1 pairwise disjoint perfect matchings. Let Mi,..., M^-i 
be these matchings for the clique on A2. In the further m — I rounds, nodes in 
A2 send messages to each other along edges from Mi in round {2m — 1 ) + z. The 
next m rounds are spent sending messages between A2 and A3, etc. In general, 
saturating all internal edges of Ai takes m—1 rounds, and saturating all edges 
between Ai and A^+i takes m rounds. Hence the total number of rounds is 
m{m — 1) + (m — l)m = 2mf — 2m = 2n — 2y/n. □ 

4 Acknowledged Blind Broadcasting 

In this section we describe a family of protocols achieving acknowledged blind 
broadcasting. Their aim is to broadcast the source message to all other nodes 




Efficient Communication in Unknown Networks 



177 



of the network, and inform the source that all nodes got the message. For any a 
priori given positive constant e, we show a protocol working in worst case time 
(2 + e)n, for n-node networks, where n is sufficiently large. The protocol uses 
messages of size at most log n and additionally informs the source of the number 
of nodes in the network. 

Let e be a positive constant fixed throughout this section. The protocol con- 
sists of two phases, the first of which is modified blind broadcasting and the 
second is the acknowledgement phase. The aim of the first phase is to broadcast 
the source message to all nodes, and simultaneously create a spanning tree of the 
network, of height equal to the depth of the network. Then information about 
completion of broadcasting and the number of nodes is efficiently sent to the 
source bottom-up using this tree. 

The tree is constructed dynamically, and the depth of each node in it decrea- 
ses during the execution of the protocol, to finally reach the level number of the 
node in the network. This guarantees efficient execution of the acknowledgement 
phase but requires updating labels which indicate the current depth, and upda- 
ting current parents. The above actions must be carefully scheduled to avoid 
long delays in spreading the source message. We show how this can be done so 
that the additional delay caused by these actions be at most en. 

Another problem is avoiding confusion among various versions of the con- 
structed tree. Nodes do not know when the tree is final and hence send infor- 
mation about completion of broadcasting and about size of parts of the network 
already using earlier versions of the tree. Thus messages concerning various ver- 
sions of the tree circulate simultaneously and there is an a priori danger of 
forgetting a node or counting it more than once. This could happen if infor- 
mation from different rounds were mixed: a node earlier counted in one branch 
could subsequently change parents and be also counted in a different branch. 
For the same reason a node might not be counted at all. To avoid this confusion 
we use “time stamps” indicating which version of the tree the message refers to. 
At the end, the source makes sure to consider a single version of the tree. 

Protocol ABB(e) 

Phase 1. Let c be an integer larger than 6/e. c is appended to the original 
source message and forms the message M which is disseminated. The source 
assigns itself label 0 and sends it together with M on consecutive yet unsaturated 
edges. When a node gets M for the first time from a node with label i, it assigns 
itself label i -I- 1 and sends it together with M on consecutive incident edges. 

The crucial idea is label updating. Whenever a node v gets a new label, it 
sends it (together with M), to all neighbors u from which it didn’t hear, or from 
which it heard that w’s label is higher than v’s current label. Such neighbors u 
will be called compulsory with respect to the current label of v. If a node has 
label i and hears from a node with label j < i — 1, it updates its own label to 
j -k 1. Thus labels of each node are decreased every time they change, and the 
smallest label a node can get is the number of its level. 

Label updating is combined with dynamic construction of a spanning tree. 
At every round every informed node has a single parent which can change several 
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times in the course of the protocol. The current parent of node v is its neighbor 
whose message caused v to modify its label most recently (or whose message 
woke up V, in case when v still has its first label). 

A node v acknowledges its current parent by sending to it a message “you 
became my parent” and cancels the old parent by sending to it a message “you 
stopped being my parent”. This is combined with label updating as follows. 

After getting a new label I, a node v starts process Spread(?) described as 
follows. In the first c — 1 rounds after getting label I, v sends this label (together 
with M) to consecutive compulsory neighbors. In the next two rounds, it first 
cancels the old parent, then acknowledges the new parent. In further rounds 
it continues sending its label to other compulsory neighbors. If during process 
Spread(/), v gets a new label I', Spread(l) is halted and v starts process Spread(^') 
from the beginning. 

A node involved in some process Spread(l) is called busy. Otherwise it is idle. 
Clearly, an idle informed node has all its incident edges saturated. 

A node considers as its current children all nodes from which it got an ack- 
nowledgement “you became my parent” that has not been cancelled afterwards. 

A node v that becomes idle at some round t' , starts a waiting period of c-l- 1 
rounds. If it does not change its label before the end of the waiting period, v 
becomes confirmed in round t = -I- c -I- 1. If at this round v does not have 

current children, v considers itself a leaf of the tree corresponding to round t, 
and enters Phase 2. (Notice that a confirmed node can subsequently loose its 
status by getting a new label. In this case it becomes busy, then idle, and then 
confirmed again.) 

Phase 2. Messages in phase 2 consist of a time stamp and of a content of the 
form “my subtree has size x” . Every confirmed node keeps a variable con whose 
value is the round number in which it became (most recently) confirmed. Such 
a node also keeps lists of all messages obtained from all current children. 

Phase 2 is started by confirmed nodes which consider themselves leaves of 
the tree corresponding to round t. In every round i > t such a node v sends a 
message “my subtree has size I”, with time stamp z — 1, to its current parent. 
This is done as long as v remains confirmed. 

In every round every confirmed node v that has current children looks at 
all lists of messages obtained from all of them. If there exists a common time 
stamp t' in all these lists, larger than the current value of w’s variable con, 
call the corresponding messages t'-healthy. Let t be the maximum of such t' . 
Node V takes all contents “my subtree has size xf' corresponding to all t-healthy 
messages from all its current children Cj, z = 1, ...r, and then sends the message 
“my subtree has size x\ + ■ ■ ■ + Xr + 1 '' with time stamp t to its parent. 

As soon as the source detects t-healthy messages from all its children Ui, 
i = 1, ..., fc, for some time stamp t, it knows that all nodes of the network got 

the source message, and that the total number of nodes is n = z/i H \-yk + l, 

where “my subtree has size yf” are the corresponding contents. 
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Theorem 3. Protocol ABB(e) correctly accomplishes acknowledged blind broa- 
dcasting and works in worst case time at most (2 + e)n, in any n-node network, 
for sufficiently large n. 

Proof. In order to prove the correctness of Protocol ABB(e), it is enough to show 
that detecting t-healthy messages by the source, for some time stamp t, implies 
that in round t broadcasting has been completed and this information reached 
the source bottom-up using the spanning tree frozen at time t. 

Indeed, according to the description of process Spread(l), a node may be ack- 
nowledged as a parent at most c-\- 1 rounds after becoming it. Hence, the waiting 
period imposed on a node v before it becomes confirmed, guarantees that every 
node w whose parent is v, is already counted by v among its current children. 
In particular, nodes considering themselves leaves of the tree corresponding to 
round t are indeed leaves of this tree. 

When a confirmed non-leaf v gets t-healthy messages “my subtree has size 
xf\ it knows that all of its children corresponding to round t reported the sizes 
of their subtrees corresponding to this round, and consequently v’s subtree in 
the tree corresponding to round t had size xi -I- • • • -k + 1- Node v reports 
this information (still with time stamp t) to its parent, that did not change 
since round t. Consequently, the rule of conveying information by non-leaves 
guarantees that only snapshots corresponding to the same round are relayed up 
the tree. Hence the total number of nodes computed by the source is indeed the 
number of nodes in the spanning tree corresponding to some round t, i.e., the 
number of nodes in the network. 

Note that between round t and the round when t-healthy messages are de- 
tected by the source, the tree could change many times. Nevertheless the source 
gets a correct and complete snapshot of the situation occurring in round t. 

It remains to show that the source will detect t-healthy messages for some 
t < (2 -k e)n. 

Call a node large if its degree is at least c, and small otherwise. According to 
the description of process Spread(^), the actions of cancelling and acknowledging 
parents do not delay blind broadcasting in small nodes, and each such action 
delays it by at most one round in large nodes. Let be the size of the t-th level 
of the network. Define the round 

Tj = Oi -k (ai — 1 -k 02 ) -k ... -k (fli-i — 1 -k tti) -k 2ji_i, 

where ji-i is the number of levels before the z-th that contain large nodes. 

By round all nodes in the z-th level may have changed their labels several 
times but they will certainly get the message from a neighbor in level z — 1, 
containing its final label z — 1. Hence such a node, in case when it is large, may 
only loose two further rounds to cancel the old parent and acknowledge the new 
one (which from now on will never change), and then it will pass its final (freshly 
updated) label z further. 

A computation similar to that in the proof of Theorem 1 shows that all edges 
will become saturated in round at most 2n-d-\-2x, where x is the number of levels 
containing large nodes. Clearly, x < Znjc. The waiting period is c -k 1. Hence 
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after the round 2n — d+2x + c+l, phase 2 must start, and at this point all labels 
are final, the spanning tree has height d (equal to the depth of the network) and 
never changes again. This implies that by round 2n — d+2x + c+l + d the source 
will detect t-healthy messages, for t = 2n — d + 2x + c + 1, if it has not detected 
such messages for smaller t at some earlier round. We conclude that Protocol 
ABB(e) works in worst case time at most 2n — d+2x+c+l+d < 2n+6n/c+c+l, 
which does not exceed (2 + e)n, for sufficiently large n. □ 

It should be noticed that at the time when the source learns that all nodes 
got the message in Protocol ABB(e), other nodes still send messages with new 
time stamps at each round. As mentioned in the introduction, ABB should be 
considered as one cycle in an ongoing process of broadcasting several messages, 
in which a new message should be disseminated only after everybody learned 
the previous one. When the source sends the next message, it simultaneously 
orders to stop all messages concerning the previous cycle. 

We conclude this section by describing a protocol working even faster than 
Protocol ABB(e) (in time at most 2n for an n-node network) and accomplishing 
much more. Upon its completion, all nodes get the source message, and the source 
learns not only that and the number of nodes, as in the previous case, but in fact 
learns the topology of the entire network. The drawback of this protocol is that 
the circulating messages are large, as opposed to logarithmic size in Protocol 
ABB(e). However, large messages are hard to avoid if the goal is to learn fast 
the entire topology. 

We present the protocol assuming that, when a node is informed, it gets 
a name, all names being distinct. In the full paper it will be shown how this 
naming procedure can be efficiently added to the protocol. 

The protocol works in two phases. Similarly as before, the aim of the first 
phase is to broadcast the source message to all nodes and simultaneously create 
a spanning tree of the network, of height equal to the depth of the network. 
This is done by a mechanism of label updating, as in Protocol ABB(e), with the 
exception that current parents are not acknowledged (this permits to save en 
time). In phase 2, nodes send all topological information they learned to date, 
up the tree, toward the source. 

Protocol Learn- All 

Phase 1. The source assigns itself label 0 and sends it together with the original 
message on consecutive yet unsaturated edges. Nodes get and update labels in 
the same way as in Protocol ABB. When a node v gets a new label, it sends it 
to all neighbors compulsory with respect to this label (i.e., to all neighbors u 
from which it didn’t hear or from which it heard that u’s label is higher than 
u’s current label). 

The current parent of a node is defined as before. Any node that got all 
of its edges saturated and sent its current label to all compulsory neighbors, 
enters phase 2. (As opposed to Protocol ABB, phase 2 is started by every node 
independently, not only by the leaves of the current tree.) 
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Phase 2. Every node v keeps a record CLK{v) called current local knowledge. 
It consists of the name v of the node, of its degree deg(f), and of a list of 
neighbors of v, initialized as empty. In the beginning of phase 2, every node v 
sends CLK{v) to its current parent. Whenever v gets a phase 2 message from 
a neighbor u, it updates CLK{v) by adding u to its list of neighbors and then 
sends its own CLK{v) together with all CLK{w) that it previously learned, to 
its current parent. In fact, nodes keep track of what they have sent to the current 
parent. If the parent did not change and v obtained no new information, it does 
not send anything in the given round. 

It should be noted that messages from phase 1 can still be sent after a node 
started phase 2. If v changes its label, it first sends it to all compulsory neighbors 
and then sends all available CLK{w) (together with its own) to the new parent. 

The source continuously updates all incoming records CLK(v) by adding new 
nodes to the respective neighbors lists, and applies the following termination 
rule. At each round, for any v for which it got a record CLK{v), the source 
counts the number N{v) of nodes u such that either u is in the neighbors list in 
CLK{v) or V is in the neighbors list in CLK{u). As soon as N{v) =deg(u) for 
all these v, the source concludes that all nodes in the network got the message 
and reconstructs the entire network as follows. The nodes of the network are 
precisely those nodes v for which it got records CLK{v), and nodes u and v are 
adiacent, iff either u is in the neighbors list in CLK(v) or v is in the neighbors 
list in CLK{u). 

Theorem 4. Upon completion of Protocol Learn- All, the source knows that all 
nodes got the original message and has a correct knowledge of the topology of 
the network. The protocol works in worst case time at most 2n, for any n-node 
network. 

5 Full Synchronization 

We present a simple protocol for FS working in worst case time at most 3n, for 
any n-node network. The advantage of this protocol is that it uses only messages 
of size logarithmic in n. 

The idea of the protocol is the following. In the first phase, blind broadcasting 
is accomplished and simultaneously a spanning tree (of uncontrolled depth and 
shape) is constructed. In the second phase, gossiping (i.e., all-to-all broadcasting) 
is done in this tree, thus informing all nodes that everybody got the source 
message, and permitting them to simultaneously enter the state terminated. 

Protocol Simple-FS 

Phase 1. [It is a simplified version of phase 1 of protocol ABB]. Nodes of the 
network perform blind broadcasting. When a node v gets the source message 
for the first time, and the sender of the message is neighbor u, v sends the 
message “you are my parent” to u in the next round, and then relays the source 
message on consecutive yet unsaturated edges, as usual. (As opposed to protocol 
ABB, parents never change.) If node v gets the source message for the first time 
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simultaneously from many neighbors, it chooses as its parent one of these nodes, 
arbitrarily. After getting the acknowledgement, u considers v as its child. The 
parent and all children of a node are called its tree neighbors. They are indeed 
its neighbors in the spanning tree which is being constructed. 

Phase 2. Phase 2 is started by nodes that got all their adjacent edges saturated 
and did not get the message “you are my parent” in the next round. In the 
special case when the source has degree 1, phase 2 is also started by the source, 
after it has send the message on its unique edge. The above nodes are leaves 
in the (unoriented) spanning tree constructed in phase 1. They start phase 2 
by sending the message “my subtree has size 1” to their unique tree neighbor. 
All other nodes v whose all adjacent edges are saturated, proceed as follows. 
Suppose that the tree neighbors of v are w, vi,..., Vk, and v got messages “my 
subtree has size x” from all nodes Vi, i = Then v sends the message 

“my subtree has size x\-\ + Xk + V' to node w. 

Any node that got messages “my subtree has size from all its tree neighbors 
Ui, i = l,...,fc, knows that all nodes have already got the source message and 
that the total number of nodes in the network is n = j/i + • • • + + 1 . It then 

waits until round 3n and enters the state terminated. 

Notice that the size of messages should only permit sending numbers smaller 
than n, thus size logarithmic in n suffices. 

Theorem 5. Protocol Simple-FS correctly accomplishes full synchronization 
and works in worst case time at most 3n, for any n-node network. 

Proof. Acknowledging the parent in phase 1 delays blind broadcasting by one 
round in every node, except the source. Hence, using the computation of the 
upper bound for BB from the proof of Theorem 1, we conclude that phase 1 of 
Protocol Simple-FS will be completed at the latest in round {2n — d—2) + d < 2n. 
Leaves need to wait at most 1 more round to learn that they are indeed leaves. 
Hence phase 2 is initiated by all leaves before round 2n. 

Phase 2 is completed in at most n further rounds because its duration is 
limited by the longest travel time between any pair of nodes. Hence before round 
3n every node knows that all nodes have already got the source message and 
knows n. Hence all nodes compute 3n before round 3n, and consequently are 
able to enter the state terminated simultaneously in round 3n. □ 

Our second protocol for full synchronization, Protocol Faster-FS, is slightly 
faster than Simple-FS - it works in worst-case time at most 2.9n, for sufficiently 
large n - but it uses large messages, similarly as Protocol Learn- All. Its additional 
advantage is that it informs all nodes about the entire topology of the network. 
Since the description and analysis of the protocol are fairly involved, we do not 
include them in this extended abstract. 

6 Conclusion and Open Problems 

We considered the number of rounds required for three communication tasks of 
increasing difficulty, performed by nodes entirely ignorant of the network. 
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The optimal time of full synchronization remains an open problem. However, 
it should be noted that if the source knows a priori the size n of the network, full 
synchronization can be easily achieved in time at most 2n. Indeed, it is sufficient 
to perform blind broadcasting, adding the starting time and the value n to the 
original message. In view of Theorem 1 all nodes get the message before round 
2n. Then, knowing n, they wait until round 2n and enter state terminated. 

It seems interesting to study also other communication problems, such as gos- 
siping (i.e., all-to-all broadcasting) or multicasting, under the network ignorance 
scenario. Other important questions concern the impact of partial knowledge of 
the network (e.g., nodes knowing only such parameters as the size, the diameter, 
or the maximum degree of the network) on the efficiency of our communication 
protocols. 
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Abstract. We present the first efficient algorithm for a coarse grained 
multiprocessor that colors a graph G with a guarantee of at most Ac + 1 
colors. 



1 Introduction and Overview 

The problem of graph coloring is crucial both for the applications of graph algo- 
rithms to real world problems and for the domain of parallel graph algorithms 
itself. For the latter, graph colorings using a bounded number of colors are often 
used in a theoretical setting to ensure the independence of tasks that are to be 
accomplished on the vertices of a graph, i.e. since the color classes form inde- 
pendent sets that don’t interact each one of them can be treated in parallel. For 
a long time, no efficient parallel implementation of a graph coloring heuristic 
with good speedups was known [1]. However, in a recent result, Gebremedhin 
and Manne [4,5] present an algorithm and an implementation for a shared me- 
mory eomputer that proves to be theoretically and practically efficient with good 
speedups. 

In this paper we make this successful approach feasible for a larger variety of 
architectures by extending it to the more general setting of coarse grained multi- 
processors (GGM) [3]. This model of parallel computation makes an abstraction 
of the interconnection network between the processors of a parallel machine (or 
network) and tries to capture the efficiency of a parallel algorithm using only 
a few parameters. Several experiments show that the GGM model is of prac- 
tical relevance: implementations of algorithms formulated in the GGM model in 
general turn out to be feasible, portable, predictable and efficient [7] . 

This paper is organized as follows. In the next section we review the coarse 
grained models of parallel computation and the basics of graph coloring heu- 
ristics. Then, we present our algorithm together with an average case analysis 
of its time and work complexity. Finally, we show how to handle high degree 
vertices and how to alter the algorithm to achieve the same good time and work 
complexity also in the worst-case. 



U. Brandes and D. Wagner (Eds.): WG 2000, LNCS 1928, pp. 184-195, 2000. 
© Springer- Verlag Berlin Heidelberg 2000 




Graph Coloring on a Coarse Grained Multiprocessor 185 



1.1 Coarse Grained Models of Parallel Computation 

In recent years several efforts have been made to define models of parallel 
(or distributed) computation that are more realistic than the classical PRAM 
models. In contrast to the PRAM, these new models are coarse grained, i.e. 
they assume that the number of processors p and the size of the input N of an 
algorithm are orders of magnitudes apart, p N. By that assumption these 
models map much better on existing architectures where in general the number 
of processors is at most some thousands and the size of the data that are to be 
handled goes into millions and billions. 

This branch of research got its kick-off with Valiant [8] introducing the so- 
called bulk synchronous parallel machine, BSP, and was refined in different di- 
rections for example by Culler et al. [2] to LogP, and by Dehne et al. [3] to 
CGM. 

We place ourselves in the context of CGM which seems to be well suited for 
a design of algorithms that are not too dependent on an individual architecture. 
We summarize the assumptions of this model as follows: 

~ All algorithms perform in so-called supersteps. A superstep consists of one 
phase of interprocessor communication and one phase of local computation. 

— All processors have the same size M = 0{N/p) of memory. 

— The communication network between the processors can be arbitrary. 

The goal when designing an algorithm in this model is to keep the individual 
workload, time for communication and idle time of each processor within T /s{p) 
where T is the runtime of the best sequential algorithm on the same data and 
s{p), the speedup, is a function that should be as close to p as possible. To be 
able to do so, it is considered as good idea to keep the number of supersteps of 
such an algorithm as low as possible, preferably o{M). 

The rationale for that is that for the communication time there are at least 
two invariants of the architecture that come into play: the latency, i.e. the mi- 
nimal time a communication needs to startup before any data reach the other 
end, and the bandwidth, i.e. the overall throughput per time unit of the commu- 
nication network for large chunks of data. In any superstep there are at most 
0{p) communications for each processor and so a number of supersteps of o{M) 
ensures that the latency can be neglected for the performance analysis of such an 
algorithm. The bandwidth restriction of a specific platform must still be obser- 
ved, and here the best strategy is simply to reduce the communication volume as 
much as possible. See Guerin Lassous et al. [7] for an introduction and overview 
on algorithms, code and experiments within the coarse grained setting. 

As a legacy from the PRAM model it is usually assumed that the number 
of supersteps should be polylogarithmic in p, but there seems to be no real 
world rationale for that. In fact, no relationship of the coarseness models to the 
complexity classes NC^ have been found, and algorithms that simply ensure a 
number of supersteps that are a function of p (and not of N) perform quite well 
in practice [6]. 




186 



A.H. Gebremedhin et al. 



To be able to organize the supersteps well it is natural to assume that each 
processor can keep a vector of p-sized data for each other processor and so the 
coarseness requirement translates into 

< M K. N/p (1) 



1.2 Graph Coloring 

A graph coloring is a labeling of the vertices of a graph G = (V, E) with posi- 
tive integers, the colors, such that no two neighbors obtain the same color. An 
important parameter of such a coloring C is the number of colors x(C') that 
the coloring uses. This can be viewed equivalently as searching for a partition of 
the vertex set of the graph into independent sets. Even though coloring a graph 
with the fewest number of colors is an NP-hard problem, in many practical and 
theoretical settings a coloring using a bounded number of colors, possibly far 
from the minimum, may suffice. Particularly in many PRAM graph algorithms, 
a bounded coloring (resp. partition into independent sets) is needed as a sub- 
routine. However, up to recently no reasonable coloring had shown to perform 
well in practical parallel settings. 

One of the simplest but quite efficient sequential heuristics for coloring is the 
so-called list coloring. It iterates over v € V and colors v with the least color 
that has not yet been assigned to one of its neighbors. It is easy to see that this 
heuristic always uses at most Aq + 1 colors, where Aq = max„gy{degree of f }. 
A parallelization of list coloring has been shown by Gebremedhin and Manne [5] 
to perform well both in theory and by experiment on shared memory machines. 
They show that distributing the vertices evenly among the processors and run- 
ning list coloring concurrently on the processors, while checking for color compa- 
tibility with already colored neighbors, creates very few conflicts: the probability 
that two neighbors are colored at exactly the same instance of the computation 
is quite small. Their algorithm was proven to behave well on expectation and 
the conflicts could easily be resolved thereafter with very low cost. 



1.3 The Distribution of Data on the Processors 

Even more than for sequential algorithms, a good organization of the data is 
crucial for the efficiency of parallel code. We will organize the graphs that we 
handle as follows: 

— Every processor Pi is responsible for a specific subset Ui of the vertices. For 
a given vertex v this processor is denoted by Py. 

— Every edge is represented as arcs (v,w), stored at and (w,v), 

stored at Py,. 

— For every arc (v,w) processor stores the identity of Py, and thus the 
location of the arc (w,v). This is to avoid a logarithmic blow-up due to 
searching for Py,. 

— The arcs are sorted lexicographically and stored as a linked list per vertex. 
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For convenience, we will also assume that the input size N for any graph al- 
gorithm is in fact equal to the amount of edges \E\. Up to a constant which 
corresponds to the encoding size of an edge this will always be achieved in gra- 
phs that don’t have isolated vertices. If the input that we receive is not of the 
desired form it can be efficiently tranformed into one by the following steps: 

— Generate two arcs for each edge as described above, 

~ Radix sort (see Guerin Lassous et al. [7] for a GGM radix sort) the list of 
arcs such that each processor receives the arcs (u, w) if it is responsible for 
vertex w, 

— Let every processor note its identity on these sibling arcs, 

— Radix sort the list of arcs such that every processor receives its proper arcs 
(arcs (v,w) if it is responsible for vertex v). 

On a somewhat simplified level, the parallelization of list coloring for shared 
memory machines as given by Gebremedhin and Manne [5] worked by tackling 
the list of vertices numbered from 1 to n in a ‘round robin’ manner: at a given 
time 1 < t ^ processor Pi colors vertex {i — 1) ■ - + t. The shared memory 
assumptions ensure that Pi may access the color information of any vertex at 
unit cost of time. The only problems that can occur are with the neighbors that 
are in fact handled at exactly the same time. Gebremedhin and Manne [5] show 
that the number of such conflicts is small on expectation, and that they can 
easily be handled a posteriori. In a coarse grained setting we have to be more 
careful about the access to data that are situated on other processors. 

2 The Algorithm 

As the best sequential algorithm for an approximate graph coloring is linear in 
the size of the graph |G|, our aim is to design a parallel algorithm in GGM with 
a work (total number of local computations) per processor in 0(-^) and overall 
communication costs in 0(|G|). 

Algorithm 1 colors any graph G such that Aq < M recursively. Section 3 
shows how to get rid of high degree vertices. The first type of high degree ver- 
tices are the vertices v such that deg{v) > The second type of high degree 
vertices are the vertices v such that ^ < deg{v) < Note that the first call to 
Algorithm 1 is applied on the whole graph G {C = G). 

In Algorithm 1 , we group the vertices that are to be handled on each processor 
into k different timeslots, see the figure and group vertices in the algorithm. 
The number 1 < fc < p of timeslots is a parameter of our algorithm. For each 
such timeslot we group the messages to the other processors together, see send 
to neighbors and receive from neighbors, and thus in general we produce 
few, even if possibly large, messages. Figure 1 shows a graph on 72 vertices 
distributed onto 6 processors and 4 timeslots. 

A naive parallel list coloring will respect adjacencies between timeslots, but 
may violate adjacencies listed in bold, inside a timeslot. Our algorithm avoids 
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Fig. 1. Graph on 72 vertices distributed onto 6 processors and 4 timeslots. 



this by coloring the bold subgraphs recursively until they are small enough to 
fit onto a single processor. In the recursive call, a vertex will belong to the 
same processor but probably to a different timeslot. Note that in general some 
processors may receive more vertices than others. 

By proceeding like this, we may generate more conflicts than in the shared 
memory algorithm; we may accidentally color two neighbors in the same timeslot 
with the same color. We find it convenient to resolve these conflicts beforehand, 
see identify conflicts and resolve conflicts, by a recursive call to the same 
algorithm. We must ensure that these recursive calls do not produce a blow-up 
in computation and communication, and in fact the main part of the technical 
details in the paper are dedicated to ensure this. 

In the recursive calls we must handle the restrictions that are imposed by 
previously colored vertices. We extend the problem specification and assume 
that a vertex v also has a list Fy of forbidden colors that is initially empty. 
An important issue for the complexity bounds will be that we will only add 
individual forbidden colors to Fy as the knowledge about them arrives on Py. 
The list Fy as a, whole will only be touched once, namely when v is finally colored. 

Observe also that the recursive calls in line resolve conflicts are not issued 
synchronized between the different processors: it is not necessary (nor desired) 
that the processors start recursion at exactly the same moment in time. When 
the calls reach the communication parts of the algorithm during recursion, they 
will be synchronized automatically when waiting for the data of each other. ^ 



^ In fact, when writing this the authors got more and more convinced of the impor- 
tance of asynchronicity between those of us working in Europe and the one visiting 
Australia. 
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Algorithm 1: List coloring on a CGM with p processors 

Input: Subgraph G' = {V',E') of a base graph G = (V,E) with M' 
edges per processor such that Aq' < M' , M the initial input size 
per processor and lists of forbidden colors for the vertices. 
Output: A valid coloring of G' with at most Aq + 1 colors. 

bottom: 

if the size of G' is less than then solve the problem sequentially on 

processor Pi; 

else 

high degree: 

Get rid of high degree vertices of second type (see Section 3); 

group vertices: 

Each processor Pi groups its vertices Ui into k timeslots Ui^t, t = 
1, . . . ,k, such that the degree sum in all timeslots is about the same; 
For each vertex v denote the index of its timeslot by 
for t = 1 to k do 

foreach processor Pi do 

identify conflicts: 

Gonsider all arcs e = (u, w) with v € Ui^t and G = tw = t; 
Name this set Si and consider the vertices Vs^ that have such 
an arc; 

resolve conflicts: 

Recursively color the graph (IJ U 'S'i); 

color timeslot: 

foreach uncolored vertex v with ty = t do color v with least 
possible color; 

send to neighbors: 

foreach arc (v,w) with v € Ui, E = t and ty, > t do collect 
the color of v for Py, in a send buffer; 
send out the tuples (w, color); 

receive from neighbors: 

receive the colors from the other processors; 
foreach arc {v,w) with w € Ui, ty = t and ty, > t do add the 
1 _ color of V to Fy,; 



Another issue that we have to face is the possible degree variation among 
the vertices. Whereas for the shared memory algorithm, different degrees of 
the vertices that are handled in parallel just causes a slight asynchronicity of 
the execution of the algorithm, in a GGM setting it might result in a severe 
imbalance of charge and even in a memory overflow of individual processors. We 
will see in Section 3 how to handle this. 
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2.1 An Average Case Analysis 

To get a high level understanding of the algorithm, let us first assume that we 
deal with the best of all worlds, that is: 

— we never have high degree vertices of first and second type (d is a high degree 

if d > ^), ^ 

— the events of having an edge between a pair of vertices are all random and 
totally independent of each other. 

In Section 3, we show how to handle high degree vertices and in Section 4 
we show what processing we have to add if we don’t assume randomness for the 
edges. 

Lemma 1. Consider Algorithm 1. For any edge {u, w}, the probability that ty = 
tw i^ p ■ 

Proof. The expected size of the degree sums of the timeslots Ui^t is the same. 
Since there are k timeslots, when fixing the timeslot of v the probability that w 
is in the same timeslot on its processor is □ 

Lemma 2. The expected size of all subgraphs over all t = 1, . . . ,k in resolve 
conflicts is N/k. 

Proof. Every processor has M edges so each timeslot is expected to contain M/k 
edges, M/fc^ of which may create conflicts. So on each processor, considering all 
the timeslots, we expect M/k conflict edges and in total N/k. □ 

Lemma 3. For any value 1 < k < p the expected number of supersteps is linear 
in p. 

Proof. Each call can initiate k recursive calls. The maximal recursion depth of 
our algorithm is the minimum value d such that N/k‘^ < M = N/p, i.e. k’^ > p, 
i.e. d = [logj,p]. The total number of supersteps in each call is c- k, for some 
constant c, one for each timeslot plus some to get rid of high degree vertices. 
Then, the total number of supersteps on recursion level i is given by c • fc* and 
so the total number of supersteps is 

riogfc p] 

c-N ^c-k^°SkP = c-p (2) 

i=l 

□ 

Lemma 4. Besides the cost for bottom and high degree, for any value of 
1 < k < p the expected work and communication per processor is 0{M). 

Proof. First we show that the work and communication that any processor has 
to perform is a function in the number of its edges, i.e. M . 
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Algorithm 2: Solve the problem sequentially on processor P\ 

Input: M the initial input size per processor, subgraph G' = (V', E') of a base 
graph G — {V, E) with \E'\ < M and lists Ev of forbidden colors for 
the vertices. 

collect colors: 

foreach processor Pi do 

Let Ui = UidV' be the vertices that are stored on Pi; 

For each v G Ui let d{v) be the degree of u in G'; 

Compute a sorted list of the least d(v) + 1 allowed colors for v; 

Communicate E' and all lists A-u to Pi; 

solve sequentially: 

for processor P\ do 

Collect the graph G' together with the lists 
Color G' sequentially; 

Send the resulting colors back to the corresponding processors; 

retransmit colors: 

foreach processor Pi do 

Inform all neighbors of Ui of the colors that have been assigned; 

Receive the colors from the other processors and update the lists Ev accor- 
dingly; 



Inserting new forbidden colors into an unsorted list Fy can be done in con- 
stant time per color. Any edge adds an item to the list of forbidden colors of 
one of its end- vertices at most once, so the size of such a list is bounded by the 
degree of the vertex. Thus, the total size of these lists on any of the processors 
will never exceed the input size M. 

To find the least available color in color timeslot we then have to process 
the list as a whole. This happens only once for each vertex v, and so one might 
hope to get away with just sorting the list Fy. But sorting here can be too 
expensive, comparison sort would impose a time of M log M whereas counting 
sort would lead to \Fy\+ M per vertex. 

But nevertheless the work for this can be bound as follows. Each processor 
maintains a Boolean vector colors that is indexed with the colors and that will 
help to decide for a vertex v on the least color to be taken. Since we got rid of 
high degree vertices we know that no list Fy will be longer than M/k and so a 
length oi M/k + 1 suffices for colors. 

Later, when relaxing this condition in Section 3 we will need at most p colors 
for vertices of degree greater than N/p and ensure to add no more than Z\' -|- 1 
colors, where A' is the maximum degree among the remaining vertices (A' < M). 

In total this means that we have at most p + M + 1 colors and so our vector 
colors still fits on a processor. This vector is initialized once with all values 
“true”. Then when processing a vertex v we run through its list of forbidden 
colors and set the corresponding items of colors to “false”. After that, we look 
for the first item in colors that still is true and choose that color for v. Then, 
to revert the changes we run through the list again and set all values to “true” . 
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Algorithm 3: Compute the allowed colors Ay of a vertex v. 

Input: V together with its actual degree d(v) and its (unordered) list Fy of 
forbidden colors; A Boolean vector colors with all values set to true. 

foreach c G Fy do Set colors\c\ = false; 

for (c = 1; \ Ay\ < d(v); + + c) do if colors[c] then Ay = c + 
foreach c G Fy do Set color s\c\ = true; 



This then clearly needs at most a time of p + M + 1 plus the sizes of the list, so 
0{M) time in total. 

As seen above, on any processor the total fraction of edges going into re- 
cursion is expected to be M/k, so the Main Theorem for divide and conquer 
algorithms shows that the total costs are 0{M). □ 



2.2 The Bottom of Recursion 

At first one might be tempted to think that the bottom of the recursion should 
easily stay within the required bounds and only communicates as much data as 
there are edges in the corresponding subgraph. But such an approach doesn’t 
count for the lists Fy of forbidden colors that the vertices might already have 
collected during higher levels of recursion. The size of these lists may actually 
be too large and their union might not fit on a single processor. To take care of 
that situation we proceed in three steps, see Algorithm 2. 

In the first step collect colors, for each vertex v G V we produce a short 
list of allowed colors. In fact, the idea is that when we color the vertex later 
on we will not use more than its degree -1-1 colors so a list of d{v) + 1 allowed 
colors suffices to take all restrictions of forbidden colors into account. With the 
same trick as in the previous section, we can get away with a computation time 
of |F„| -I- d{v) to compute the list Ay, see Algorithm 3. It is also easy to see 
that we can use this trick again when we sequentially color the graph on P\ . We 
summarize: 

Lemma 5. The bottom of the recursion for graph G' = {V ,E') with lists of 
forbidden colors Fy can be done in a constant number of communication steps 
with overall work that is proportional to |G'| and the lists Fy and with a com- 
munication that is proportional to |G'|. 

Note that there will be k calls to bottom, therefore Pi handles at most 

edges and ^ < kp-^ = M. That implies a 

total time for bottom of 0{M). 

3 Getting Rid of High Degree Vertices 

Line group vertices of Algorithm 1 groups the vertices into k < p timeslots of 
about equal degree sum. Such a grouping would not be possible if the variation 
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Algorithm 4: Get rid of high degree vertices of second type, 
foreach processor Pi do 

find all V £ Ui with degree higher than M' /k (Note: all degrees less than 
N/p)- 

send the names and the degrees of these vertices to Pi; 
for processor Pi do 

Receive lists of high degree vertices; 

Group these vertices into k' <k timeslots Wi, Wy of at most p vertices 
each and of a degree sum of at most 2Njp for each timeslot; 

_ Communicate the timeslots to the other processors; 
foreach processor Pi do 

Receive the timeslots for the high degree vertices in Up, 

Communicate these values to all the neighbors of these vertices; 

Receive the corresponding information from the other processors; 

Compute Et,i for t = 1, ... ,k' where one endpoint is in Up, 
for t — \ to k' do 

foreach processor Pi do Communicate Et,i to processor Pi; 
for processor Pi do 

Receive Et = Ui<i<p-®'*h denote by Gt ~ {Wt,Et) the induced 
subgraph of high degree vertices of timeslot t; 

Solve the problem for Gt sequentially, see Algorithm 2; 



in the degrees of the vertices is too large. For example, if we have one vertex of 
very large degree, it would always dominate the degree sum of its timeslot and 
we cannot achieve a balance. So we will ensure that the degrees of all vertices 
is fairly small, namely smaller than M/k. Observe that this notion of ‘small’ 
depends on the input size M and thus the property of being of small degree 
may change during the course of the algorithm. This is why we have to have the 
line high degree in every recursive call and not only for the top level call. On 
the other hand this choice of M/k will leave us enough freedom to choose k in 
the range of 2, ... ,p as convenient. 

We distinguish two different kinds of high degree vertices. The first type we 
have to handle in a preprocessing step that is only done once on the top level 
of recursion. These are vertices for which the degree is greater than M = N/p. 
In fact, these vertices can’t have all their arcs stored at one processor alone. 
Clearly, overall we can have at most p such vertices, otherwise we would have 
more than N edges total. Thus the subgraph induced by these vertices has at 
most p^ edges. Because of (1) this induced subgraph fits on processor Pi and a 
call to Algorithm 2 in a preprocessing step will color it. 

The second type of high degree vertices, that we indeed have to treat in each 
recursive call, are those vertices v with N/{pk) = M' /k < deg{v) < M' = N/p, 
see Algorithm 4. Every processor holds at most k such vertices, otherwise it 
would hold more than {M' /k) ■ k = M' edges. So in total there are at most p ■ k 
such vertices. 
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Algorithm 5: Determine an ordering on the k = 2 timeslots on each pro- 
cessor^ 

foreach processor Pi do 

foreach edge {v, w) do inform the processor of w about the timeslot of v; 
for s = 1 . . .p do 
|_ for r, r' = 0, 1 do set mil ~ 0; 

foreach edge (v, w) do add 1 to mil ) where Pa is the processor of w and 
r and r' are the timeslots of v and w; 

Broadcast all values mil for s = 1, • • • ,p to all other processors; 
inv[l] = false-, 
for s = 2 to p do 
A" = 0; A"" = 0; 
for s' < s do 

if -iinv[s'] then 

AX =AX +m°l,+ml°, 

else 

All=All+m^h+mL°,; 

[ I +m°°,+mll, 

if AX < All then inv[s] = true-, 
else inv[s] = false; 



It is again easy to see that the probability for a vertex v to become a high 
degree vertex of second type is small: even if it actually has M' /k edges, with 
high probability only have their other endpoint in the same timeslot, and 

so V will not become of high degree on the next recursion level. So on expecta- 
tion Algorithm 4 will only contribute little to the total number of supersteps, 
workload and communication. 



4 An Add-On to Achieve a Good Worst-Case Behavior 

So far for a possible implementation of our algorithm we have a degree of freedom 
in the number of timeslots k. If we are heading for just a guarantee on expectation 
as shown above we certainly would not like to bother with recursion and can 
choose k = p. This gives an algorithm that has 3p supersteps. 

To give a deterministic algorithm with a worst case bound we choose the 
other extreme, namely k = 2. This enables us to bound the number of edges 
that go into the recursion. We have to distinguish two different types of edges: 
edges that have both endpoints on the same processor, internal edges, and those 
that have them on different processors, external edges. 

Here we only describe how to handle external edges. In fact, the ideas for 
handling internal edges during the partition of the vertices into the two different 
timeslots are quite similar, but we omitted them for this short abstract. 
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To handle the external edges we add a call to Algorithm 5 after group 
vertices in Algorithm 1. This algorithm counts the number of edges bet- 
ween all possible pairs of timeslots on different processors, and broadcasts these 
values to all processors. Then a quick iterative algorithm is executed in parallel 
on all processors that decides on the processors for which the roles of the two 
timeslots are interchanged. 

After having decided whether or not to interchange the roles of the times- 
lots on processors Pi, . . . we compute two values for processor Pf. the 

number of edges that would go into recursion if we would keep the role of the 
two timeslots, and the same number if we would interchange the role of the 
two timeslots. 
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Abstract. We proof that every graph of clique- width k which does not 
contain the complete bipartite graph Kn,n for some n > 1 as a subgraph 
has tree-width at most 3fc(n — 1) — 1. This immediately implies that 
a set of graphs of bounded clique-width has bounded tree-width if it is 
uniformly 1-sparse, closed under subgraphs, of bounded degree, or planar. 



1 Introduction 

The clique-width of a graph is defined by composition mechanisms for vertex- 
labeled graphs, see [5] . The operations are the vertex disjoint union of labeled 
graphs, the addition of edges between vertices controlled by some label pair, 
and a relabeling of the vertices. The used number of labels corresponds to the 
clique-width of the defined graph. Clique-width bounded graphs are especially 
interesting from an algorithmic point of view. A lot of NP-complete graph pro- 
blems can be solved in polynomial time for graphs of bounded clique-width if 
the composition tree of the graphs is explicitly given. For example, the set of all 
graph properties which are expressible in monadic second order logic with quan- 
tifications over vertices and vertex sets (MSOi-logic) can be solved in linear 
time on clique-width bounded graphs [4]. The MSOi-logic has been extended by 
counting mechanisms which allow the expressibility of optimization problems, 
see [4]. All these problems expressible in the extended MSOi-logic can be sol- 
ved in polynomial time on clique-width bounded graphs. Furthermore, a lot of 
NP-complete graph problems which are not expressible in MSOi-logic or exten- 
ded MSOi-logic like Hamiltonicity and the simple max cut problem can also be 
solved in polynomial time on clique- width bounded graphs, see [14]. 

In [14] the notion of NLC-width is defined by a composition mechanism for 
vertex-labeled graphs which is similar to that for clique-width. Every graph of 
clique-width at most k has NLC-width at most k and every graph of NLC- 
width at most k has clique- width at most 2k, see [11]. The only essential diffe- 
rence between the composition mechanisms of clique-width bounded graphs and 
NLC-width bounded graphs is the addition of edges. In an NLC-composition 
the addition of edges is combined with the union operation. The union opera- 
tion applied to two graphs G and J is controlled by a set S of label pairs such 
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that for each pair (a, b) all vertices of G labeled by a will be connected with all 
vertices of J labeled by b. The tree structure of the NLC-expression is a very 
suitable representation for the efficient processing of the graph with respect to 
a graph property or an optimization problem. This shows especially the algo- 
rithmic framework introduced in [14], and the proof of the main theorem of this 
paper. 

The most famous class of graphs for which a lot of NP-complete graph pro- 
blems can be solved in polynomial time is the class of tree- width bounded graphs, 
see Bodlaender [2] for a survey. All graph properties expressible in monadic se- 
cond order logic with quantifications over vertex sets and edge sets (MS02-logic) 
can be solved in linear time for tree-width bounded graphs, see [6]. The MSO 2 - 
logic has also been extended by counting mechanisms to express optimization 
problems which can then be solved in polynomial time for tree-width bounded 
graphs, see [1]. 

It is already known that each graph of tree-width at most k has clique-width 
at most 2^+^ -I- 1, see [5], and NLC-width at most 2^+^ — 1, see [14]. However, the 
set of all graphs of clique-width at most 2 is equivalent to the set of all graphs 
of NLC-width 1, and equivalent to the set of all cographs. Since the set of all 
cographs contains all complete graphs, the set of all graphs of clique-width at 
most 2 and the set of all graphs of NLC-width 1 do not have bounded tree-width. 

In this paper, we proof that each graph of NLC-width k which does not 
contain the complete bipartite graph for some n > 1 as a subgraph has 

tree-width at most 3k{n— 1) — 1. This result immediately implies a lot of further 
characterizations of tree- width bounded graphs. For example a set L of clique- 
width bounded graphs has bounded tree- width if (1.) the graphs of L do not 
contain arbitrary large complete bipartite subgraphs (2.) the graphs of L are 
uniformly Lsparse, (3.) the set of all minors of the graphs of L is not the set 
of all graphs, (4.) the graphs of L are planar or have bounded degree, (5.) the 
set of all bipartite planar subgraphs, subgraphs, or minors of the graphs of L 
has bounded clique-width, (6.) L is closed under bipartite planar subgraphs, 
subgraphs, or minors. For case (1.), (2.), (3.), and (5.) the reverse direction 
holds also true. 

In [5] it is already shown that there is some function / such that a set of 
graphs of clique- width at most k which does not contain some Kn^n as a subgraph 
has tree- width at most f{n, k). However, Courcelle and Olariu proofed only the 
existence of such a function /. In this paper we explicity give a bound on the 
tree-width and additionally the corresponding tree-decomposition. 



2 Basic Definitions 

We work with finite undirected graphs G = {Vc,Eo), where Vq is a finite set 
of vertices and Eq C {{m,u} | G Vq, u ^ v} is a, set of edges. Graph 
J = (Vj, Ej) is a subgraph of G if Vj is a subset of Vq and Ej is a subset of Eq- 
J is an induced subgraph of G if additionally Ej = {{m, w} G Eq | m, u G Vj}. 
We say a set of graphs L is closed under taking subgraphs or induced subgraphs 
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if for every graph G G L all subgraphs or induced subgraphs of G, respectively, 
are in L. 

Next we recall the definitions of tree-width, clique-width, and NLC-width of 
a graph. To distinguish between the vertices of general graphs and trees, we call 
the vertices of the decomposition-trees or expression-trees nodes. 



Definition 1 (Tree- width, [13]). A tree decomposition of a graph G = 
(Vg: Eg) is a pair {X, T) where T = (Vr, Et) is a tree and X = {Xu \ u € Vt{ 
is a family of subsets Xu C Vq one for each node u of T such that 

1. AueVr^u = Vq. 

2. For every edge {rui, W 2 } G Eg, there is some node u € Vr such that wi G Xu 
and W 2 G 

3. For every vertex w G Vq the subgraph ofT induced by the nodes u G Vr with 
w G Xu is connected. 

The width of a tree decomposition (X = {Xu \ u G Vr}, T = (Vt,Et)) is 
maxugVr \Xu \ — 1. The tree-width of a graph G is the minimum tree-width of all 
tree decompositions ofG. 



Courcelle and Olariu define in [5] the notion of clique-width for labeled gra- 
phs. Let [k] :={!,... , k} be the set of all integers between 1 and k. A k-labeled 
graph G := (Vg, -S'G, labc) is a graph {Vg,Eg) whose vertices are labeled by 
some mapping labG ■ V ^ [k]. A labeled graph J = {Vj, Ej, labj) is a subgraph 
of G if Vj C Vg, Ej C Eg and labj('u) = labG(M) for all u G Vj. A labeled 
graph which consists of a single vertex labeled by t G [k] is denoted by •(. 



Definition 2 (Clique-width, [5]). Let k be some positive integer. The class 
GWk of labeled graphs is recursively defined as follows. 



1. The single vertex graph •( for some t G [k] is in CWk. 

2. Let G = (VG,EG,labG) G CWk and J = (Vj,Ej,labj) G CWk be two 
vertex disjoint labeled graphs. Then G © J := {V , E' , lab') G CWk defined 
by V := Vg U Vj, E' := Eg U Ej, and 



lab' {u) 



f labG(u) ifuGVG 
\ labjiu) if uGVj 



Mu G V'. 



3. Let i,j G [k] be two distinct integers and G = (Vg, Eg, labG) G CWk be a 
labeled graph then 

a) pi^j{G) := {Vg, EG,lab') G CWk defined by 

lab'iu) := V^GVg 

[j iflabG{u) = i 



and 

b) rii,j{G) := {Vg,E' , labG) G CWk defined by 



E' \= E A {{m, v} I lab{u) = i, lab{v) = j}. 
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The clique-width of a labeled graph G is the smallest integer k such that G € 

GWk. 

Wanke defines in [14] the notion of NLC-width^ of labeled graphs. 

Definition 3 (NLC-width, [14]). Let k be some positive integer. The class 
NLCk of labeled graphs is recursively defined as follows. 



1. The single vertex graph for some t € [k] is in NLCk. 

2. Let G = (VG,EG,laba) G NLCk and J = (Vj, Ej,labj) G NLCk be two 
vertex disjoint labeled graphs and S C [k]'^, then G Xs J ■= (V' , E' ,lab') G 
NLCk defined by V := Vg U Vj, 

E' \= EgGI Ej\J {{u,v} \u GVg, vGVj, {labG{u),labj{v)) G S}, 



and 



lab' {u) 



f labG(u) ifuGVG 
\ labj{u) if uGVj 



Mu G V. 



3. Let G = {Vg,Eg, Mg) G NLCk and R : [k] ^ [fc], then o/j(G) := {Vg,Eg, 
lab') G NLCk defined by lab'{u) := R{lab{u)), Mu G Vg. 



The NLC-width of a labeled graph G is the smallest integer k such that G G 

NLCk. 

An unlabeled graph G = (Vg,Eg) has clique-width k (NLC-width k) if 
there is some labeling labc : Vg -G [k] such that (Vg, LIg) labc) has clique-width 
k (NLC-width k, respectively). Since a relabeling of vertices does not change 
the clique-width or NLC-width of a graph, we can assume that the vertices in 
unlabeled graphs are all equally labeled. This allows us to use the notation graph 
without any confusion for labeled and unlabeled graphs. 

There is a close relation between the clique-width and the NLC-width of a 
graph as the next theorem states. 

Theorem 1 ([10], [11]). Every graph of clique-width k has NLC-width at most 
k and every graph of NLC-width k has clique-width at most 2k. 

The set of graphs of NLC-width at most k is closed under induced sub- 
graphs and edge complement, i.e., if G = (Vg, Ag, labG) G NLCfe then G = 
(VG,£^G,labG) G NLCfc for Eg = {{u,v} \_u,v gVg, m yf v, {u,?;} ^ Eg}, see 
[14]. If G has clique- width at most k then G has clique-width at most 2k, see [5]. 
The set of graphs of clique-width at most 2 and NLC-width 1 is exactly the set 
of all cographs, see [5,14]. Thus the tree-width of a graph is not bounded by its 
clique-width or NLC-width. Distance hereditary graphs have clique-width and 
NLC-width at most 3, see [9]. The clique- width and NLC-width of permutation 

^ The abbreviation NLC results from the node label eontrolled embedding mechanism 
originally defined for graph grammars. 
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graphs, interval graphs, grids and planar graphs is not bounded by some fixed 
integer k, see [9]. If a graph has tree- width k then it has clique- width at most 
2^+1 _|_ ggg | 5 j^ NLC-width at most 2^+^ — 1, see [14]. A graph with n 
vertices has clique-width at most n — r, if 2'’ < n — r, and NLC-width at most 
\^~\, see [11]. The recognition problem for graphs of clique- width at most k and 
graphs of NLC-width at most k is still open for fc > 4 and fc > 3, respectively. 
Clique-width of at most 2 and NLC-width of at most 1 are decidable linear time, 
see [8]. Clique- width of at most 3 and NLC-width of at most 2 are decidable in 
polynomial time, see [3,12]. 



3 The Main Theorem 

The expression-tree of an NLC-expression is an ordered rooted tree whose nodes 
are labeled by the operations of the expression and whose arcs are pointing to 
the roots of the expression-trees of the involved sub-expressions. 

Definition 4 (Expression-tree). The expression-tree T of •( consists of a 
single node r (the root of T) labeled by •t- The expression-tree T of or{G) 
consists of the expression-tree T' of G with an additional node r (the root ofT) 
labeled by or and an additional arc from r to the root ofT'. The expression-tree 
T of G Xs J consists of the disjoint union of the expression-trees Tq and Tj of 
G and J, respectively, with an additional node r (the root ofT) labeled by xr 
and two additional arcs from node r to the roots of Tq and Tj . The left son of 
r is the root ofTc, the right son of r is the root ofTj. 

Note that there is a one-to-one correspondence between the vertices of G 
and the leafs of the expression-tree Tq of an NLC-expression for G. Let Ta{u) 
for some node u of Tq be the subtree of Tq induced by node u and all nodes 
V for which there is directed path from m to w in Tq. The tree Tg{u) is an 
ordered rooted tree with root u. The expression of Tg{u) defines a (possibly) 
relabeled induced subgraph G{u) of G. The vertices of G{u) are the vertices of 
G corresponding to the leafs of the subtree Tg{u). The edges of G{u) are those 
edges of G for which both end vertices are in G{u). The labels of the vertices 
in G{u) are defined by the expression of Tg(u) which is a sub-expression of the 
expression of Tq ■ These labels are not necessarily the final labels of the vertices 
as in graph G, because the vertices of G{u) can be relabeled by the operations 
of the nodes on the path from the root of Tq to the father of u. 

Theorem 2. Let G be a graph of NLC-width k such that the complete bipartite 
graph Kn,n for some n > 1 is not a subgraph of G, then G has tree-width at most 
5k{n — 1) — 1. 

Proof. We define a tree-decomposition {X = {A„ | u &Vt}, T = {Vt,Et)) of 
width at most 3fc(n — 1) — 1 from a given expression-tree Tq = (Vtc,Etq) for 
some graph G = (Vg,Eg) of NLC-width k that does not contain the complete 
bipartite graph as a subgraph. 
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The tree T = {Vt,Et) of the tree-decomposition is defined from the 
expression-tree Tq for G as follows. 

1. The node set of T is the node set of Tq- 

2. There is an edge {u, f } in T if and only if there is an arc from m to u in Tq- 

To define the sets for the nodes u of T, we need some further notations. 
For a node u of Tq and a label t £ [k], let A{u, t) be the set of all vertices that 
appear in G{u) with label t. Let B{u, t) be the set of all vertices of G that are 
not in G{u) but which are adjacent in G to some vertex of A(u, t). Note that all 
vertices of A{u,t) are adjacent to all vertices of B{u,t), because equal labeled 
vertices of G{u) will be treated in the same way by all the operations of the 
nodes on the path from the root of Tq to the father of u. That is, either set 
A{u, t) or set B{u, t) or both sets have less than n vertices, otherwise G contains 
some Kn^n as a subgraph which contradicts our assumption. 

Now we define the sets C Vg for the nodes u of T depending on the label 
of u in Tg- 

1. If node u is labeled by *t, i.e, if u is a leaf in Tg- 

Then Xu is defined to be the set which consists of the single vertex of G 
corresponding to the leaf u of Tg- 

2. If node u is labeled by o^: 

Let V be the son of u in Tg- Then Xu is defined to be the union of all sets 
A{v,t) and B{u,t) for t £ [k] which contain less than n vertices. 

3. If node u is labeled by Xg: 

Let vi,V2 be the two sons of u in Tg- Then Xu is defined to be the union 
of all sets A{viA), ^{'^2^)1 and B{u,t) for t G [k] which contain less than n 
vertices. 

It remains to show that {X, T) is a tree-decomposition of width at most 
3fc(n — I) — 1. 

1. DueVr^u = Vg- 

Since each vertex of G corresponds to some leaf of Tg, all vertices of G are 
already contained in the union of all sets Xu for the leafs u of T. 

2. \Xu\ < 3k{n — 1) for each u G Vr- 

This follows from the fact that each either consists of a single vertex or 
is the union of at most 3k sets of size at most n — 1. 

3. For each edge {^1,^2} of G there is some node u £ Vt such that Wi,W2 G 

Let {ici, W2} be an edge of G and u be the node of Tg such that u has two 
sons vi and V2 such that wi is a vertex of G{vi) and W2 is a vertex of G{v2)- 
The connection between w\ and W2, i.e., the edge {wi,W2} £ Eg, is due to 
the X5 operation of node u- Let ti and t2 be the labels of and W2 in graph 
G(ui) and graph G{v2), respectively. If v\ is the left son and V2 the right 
son of u then (ti,t2) £ S- We also know that G A{vi,ti), W2 £ A(v2,t2), 
W2 G B{v\,t\), and rui G B{v2,t2) by the definitions of these sets. 
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a) If < n and |A(w2,i2)| < n then wi and W2 are both in 

because then A{vi,ti) C Xu and A{v2,t2) C Xu- 

b) If |A(tii,fi)| > n and \A{v2A2)\ > n then the is a subgraph of 

G{u), which contradicts our assumption. 

c) Suppose < n and \A{v2,t2)\ > n. The case for |A(r;i,ti)| > n 

and |^(w2,t2)| < n runs analogously. If \A{v2,t2)\ > n then |i?(r;2,t2)| < 
n and B{v2,t2) C thus wi € Xy^. Consider now the nodes in the 
expression-tree Tq on the path from node V2 to the leaf which corre- 
sponds to vertex W2 of G. Let ri be this leaf, be the father of 
for z > 1 , and r* be the node V2- Let U be the label of vertex W2 in 
graph G{ri) for z = 1 , . . . , s. Now we have A(ri, k) C A{ri+i, h+i) and 
B{n+i,li+i) C B{n,li) for z = 1 , . . . , s — 1 . Let j, 1 < j < s, be the 
least index such that \A{rj,lj)\ > n. Then Xy. is the set which con- 
tains both vertices Wi and W2- Since \A{rj_i,lj_i)\ < n, we know that 
A{rj-i,lj-i) C Xy^, and thus W2 € Xy^. Since \A{rj,lj)\ > n, we know 
that \B{rj,lj)\ < n, B{rj,lj) C Xy., and thus zci G Xy.. 

4 . For each vertex w G Vq, the subgraph of T induced by the nodes u with 
w G Xu is connected. 

Let w be any vertex of G and let r\ be the leaf of the expression-tree Tq 
corresponding to w. Let be the father of Vi for i = 1 , . . . ,h — 1 such 
that rh is the root of Tq- For all these nodes 1 < z < ft,, we know that 
zc is a vertex of G{ri) and thus w ^ B{n,t) for any t G [k]. Let ft be the 
label of w in graph G{ri) for z = 1 , . . . , ft. If zc G Xy. for some z > I then 
w G 2l(ri_i, ft_i). That is, the nodes rz with w G Xy^ induce a path the in 
decomposition-tree T. We call the nodes ri, . . . ,rh the back bone of vertex 
w in Tg- Note that vertex w is in subgraph G(zz) for some node zz of Tq if 
and only if zz is from the back bone of w in Tg- 

Let V be any node of Vtq — {ri, ... ,Th} such that w G Xy. Since v is not 
from the back bone, there has to be some label t G [ft] such that w £ B{v, t) 
and \B{v,t)\ < zz. That is, subgraph G{v) has at least one vertex w' labeled 
by t which is connected to w in G. Now we show that there is always a path 
in Tg from some node rz with |Gl(rz_i, ft_i)| < zz of the back bone of vertex 
w in Tg to node v. This shows that the subgraph of T induced by all nodes 
zz with w G Xu is connected. 

a) If the father of v is one of the nodes rz of the back bone with 
|v 4 (rz_i, ft_i)| < rz then nothing more is to show. 

b) If the father of v is one of the nodes of the back bone with 
|Gl(rz_i, ft_i)| > zz, then the subgraph G(rz_i) has at least zz vertices 
labeled by tz-i (the label of vertex w in G(rz_i)). Since equal labeled 
vertices are treated in the same way, all these vertices have to be connec- 
ted in G with vertices w' of G(z>). So A{n-i,ti-i) C B{v,t) and thus 
\B{v,t)\ > |A(rz_i, ft-i)| > zz, which contradicts our assumption that 
ji 3 (z;,t)j < zz. 

c) If the father v' of v is not from the back bone {ri, . . . ,rh} then let t' be 
the label of the vertices of G{v') which are labeled by t in G{v). Then 
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B{v',t') C B{v,t), \B{v',t')\ < \B{v,t)\ < n, and thus B{v',t') C Xyi. 
Since w has to be in B(v',t'), we have w G Xy/. This argumentation can 
be repeated up to the nodes of the back bone of Tq- 

□ 

Note that the bound of theorem 2 above is tight for k = 1 and n = 2. The 
complete graph with 3 vertices has NLC-width 1, does not contain the ^^ 2,2 as 
a subgraph, and has tree- width 3k {n — 1) — 1 = 2. 

A graph J = {Vj, Ej) is a minor of some graph G = {Vq, Eq) if J can be 
obtained from G by a sequence of edge deletions, edge contractions and deletions 
of vertices without incident edges. We say a set of graphs L is closed under taking 
minors if for every graph G G L all minors of G are in L. 

A graph G = {Vg,Eq) is l-sparse if \Eq\ < I ■ \Vg\- It is uniformly l-sparse 
if every subgraph of G is ^-sparse. A set of graphs is uniformly l-sparse if all its 
graphs are uniformly Z-sparse, see [7]. 

We say a set of graphs L has bounded tree-width (bounded clique-width, 
bounded NLC-width) if there is some k such that every graph G € L has tree- 
width at most k, (clique- width at most k, NLC-width at most k, respectively). 

Our main theorem immediately implies the following corollaries. 

Corollary 1. Let G be a graph of clique-width (NLC-width) at most k. 

1. If the complete bipartite graph Kn,n is not a subgraph of G, then G has 
tree-width at most 3k{n — 1) — 1. 

2. If G is uniformly l-sparse, then G has tree-width at most 6kl — 1. 

3. If there is a graph with n vertices which is not a minor of G, then G has 
tree-width at most 3k{n — 1) — 1. 

4-. If G is planar, then G has tree-width at most 6k — 1. 

5. If G has degree at most d, then G has tree-width at most 3kd — 1. 

Proof. 

1. By theorem 1 and theorem 2. 

2. If a graph G is uniformly Lsparse then the complete bipartite graph 
AT 2 i-i-i, 2 i-i-i is not a subgraph of G. 

3. If there is a graph with n vertices which is not a minor of G then the complete 
graph Kn is not a minor of G, and thus the is not a minor of G, and 
thus the Kn,n is not a subgraph of G. 

4. Planar graphs do not contain the ATa 3 as a subgraph. 

5. Graphs with vertex degree at most d do not contain the Kd+i^d+i as a 
subgraph. 

□ 

Corollary 2. Let L be a set of graphs of clique-width (NLC-width) at most k. 

1. If the set L' of all bipartite planar subgraphs, all subgraphs, or all minors 
of the graphs of L has clique-width at most I, then L has tree-width at most 

3fc(r^i-i)-i. 

2. If L is closed under taking bipartite planar subgraphs, subgraphs, or minors, 
then L has tree-width at most 3k{\^~\ — 1) — 1. 
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Proof. 

1. The I X ^-grid is bipartite, planar, has clique-width see [9] for the clique- 

width bound, and is a subgraph of ^ . That is, if L' has clique- width 

I then the I x I grid is not a subgraph of any graph of L and the ^ 

I 2 I ■ L 2 -I 

is not a subgraph of any graph of L. 

2. By (1.). 

□ 

The corollaries above can also be used to characterize sets of graphs of bo- 
unded tree-width as follows. Let L be a set of graphs of bounded clique-width. 
The set L has bounded tree-width if and only if 

1. every graph of L does not contain the complete bipartite graph as a 

subgraph for some n > 1, 

2. L is uniformly Lsparse for some I, 

3. there is some graph that is not a minor of every graph of L, or 

4. the set of all bipartite planar subgraphs, subgraphs or minors of the graphs 
of L has bounded clique- width. 

The if cases follow from corollary 1 (1.) (2.) (3.) and corollary 2 case (1.). The 
only if cases follow from the following observation. If a graph G has tree- width 
at most k then (1.) G does not contain the complete bipartite graph iLfc+i fe+i 
as a subgraph, (2.) G is uniformly fc-sparse, (3.) the complete graph iLfc +2 is not 
a minor of G, and (4.) each bipartite planar subgraph, subgraph, and minor of 
G has tree- width at most k and thus clique-width at most 2^+^ -I- 1. 
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Abstract. For any fixed parameter fc > 1, a tree k -spanner of a graph 
G is a spanning tree T in G such that the distance between every pair 
of vertices in T is at most k times their distance in G. In this paper, 
we generalize on this very restrictive concept, and introduce Steiner tree 
k -spanners: We are given an input graph consisting of terminals and 
Steiner vertices, and we are now looking for a tree fc-spanner that spans 
all terminals. 

The complexity status of deciding the existence of a Steiner tree fc- 
spanner is easy for some fc: it is AfV-hard for fc > 4, and it is in V for 
fc = 1. For the case fc = 2, we develop a model in terms of an equivalent 
tree covering problem, and use this to show A/”P-hardness. By showing 
the A/'T’-hardness also for the case fc = 3, the complexity results for all fc 
are complete. 

We also consider the problem of finding a smallest Steiner tree fc-spanner 
(if one exists at all). For any arbitrary fc > 2, we prove that we cannot 
hope to find efficiently a Steiner tree fc-spanner that is closer to the 
smallest one than within a logarithmic factor. We conclude by discussing 
some problems related to the model for the case fc = 2. 



1 Introduction 

Given a graph G, a tree k -spanner T of G is a spanning tree T, such that 
the distance between any two vertices in T is at most k times longer than the 
distance in G. The concept of fc:-spanners has been introduced by [13], and has 
been studied widely since (see, e.g., [2,6,11,12] and the references therein). 

The concept of tree fc-spanners turns out to be very restrictive in the sense 
that the class of graphs that admit a tree fc-spanner for a fixed fc is quite small. In 
some settings in the design of subnetworks, however, it is not strictly necessary to 
span all vertices or edges. In this paper, we introduce a generalized version of tree 
fc-spanners, called Steiner tree k-spanners, which models this situation: Similar 
as in the Minimum Steiner Tree Problem in unweighted graphs (cf. [10]), we 
are given an input graph consisting of terminals and Steiner vertices. We are 
now looking for a tree k-spanner that includes at least all terminals and spans 
the edges induced by them. 

Steiner spanners have appeared in the literature for example in [1], where 
the authors deal with absolute lower bounds on the number of edges that an 
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arbitrary Steiner fc-spanner (not necessarily being a tree) can have. Some authors 
have considered delay-hounded minimum Steiner trees, where one is interested 
in finding a Steiner tree T such that T fulfills some given distance constraints 
especially for the distances from a specified root (see for example [14,9]). But 
there an absolute, fixed delay bound is imposed on the distance of the vertices to 
the root, i.e. only to one particular vertex. To our knowledge, this is the first time 
where the generalization of tree fc-spanners in terms of Steiner tree /c-spanners 
is examined (see also [6]). 

1.1 Formal Definitions 

Basic notation. In what follows, G = (V, E) denotes an unweighted and undi- 
rected graph with finite vertex set V and finite edge set E. V{G) (resp. E{G)), 
denotes the vertex set (resp. edge set) of G. We do not allow loops or multiple 
edges. If i? is a subset of V, then G[i?j represents the subgraph of G that is 
induced by R. The distance between two vertices u and v in G, i.e., the num- 
ber of edges in a shortest path, is denoted by da{u,v). For a vertex v, denote 
by N{v) the set of all neighbors of v. A vertex v of a graph G is called universal 
w.r.t. V{G) if N{v) U {u} = V{G). Let v be universal w.r.t. V{G), then the star 
centered at v is the graph consisting of all vertices of G and all edges incident 
to V. For a connected graph, an articulation vertex is a vertex whose deletion 
disconnects the graph. A graph is biconnected if it has no articulation vertex. A 
block of a graph is a maximal biconnected subgraph. 

Let A be a polynomial-time algorithm that produces a feasible (though not 
necessarily optimal) solution for an optimization problem 77, and denote by A{I) 
the value of the solution that is achieved by A for the instance I. A is called 
a 6 -approximation algorithm (where 5 > 1) for 77 if for every instance 7 of 77, 

max I where OPT (7) is the value of an optimal solution 

for 7 in 77. (5 is called approximation ratio of A. A problem is inapproximahle 
(or hard to approximate) within some ratio 5 if it is A/”7^-hard to find a <5- 
approximation algorithm for this problem. See for example [3] for a survey on 
inapproximability results. 



Tree spanners. For any rational fc > 1, a spanning tree T is a tree k-spanner 
of a graph G = (V,E), if dT{u,v) < fc for all edges {u, u} G E\E{T). The 
parameter fc is called stretch factor. 

Observe that we only consider constant stretch factors, i.e., stretch factors 
that are independent of jPj and |7f|. We say that an edge e that does not belong 
to T is spanned (by T) if there exists a path of length at most fc (in T) connecting 
the end-vertices of e. Such a path is called a spanning path. 

Since distances in unweighted graphs are integral, it follows that T is a tree 
fc-spanner of G if and only if T is a [fcj -spanner of G. Thus, it suffices to consider 
integer stretch factors fc. If we look for a tree fc-spanner in a given graph G, it is 
clear that we only have to consider the biconnected components of G. Observe 
that there are many graphs that, for a fixed stretch factor fc, do not contain a 
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tree fc-spanner as a subgraph. We say that such a graph does not admit a tree k- 
spanner. For example, ^ does not admit a tree 2-spanner. The corresponding 
decision problem is called Tree /c-Spanner. 

Note that G may only admit a tree 1-spanner if G is a tree itself. Thus, Tree 
1-Spanner is in V. Furthermore, in [2] a linear algorithm for Tree 2-Spanner 
is given, whereas Tree fc-SPANNER is shown to be AfP-complete for all fc > 4. 

Steiner tree spanners. We now define the generalized version of tree k- 
spanners formally. In the following, the vertex set of the input graph G is par- 
titioned into two disjoint subsets, the set of terminals R and the set of Steiner 
vertices S. The edges of G[i?] are called terminal edges, all other edges are called 
Steiner edges. Denote by S{T), where T is a subgraph, the set of Steiner vertices 
in G used in T. 

Definition 1. Given a graph G = {R(j S,E) where G[R\ is connected, and 
an arbitrary k > 1, a subtree T of G is a Steiner tree /c-spanner of G if 
driuju) < k for every edge e = {m,u} of G[R\. 

In particular, a Steiner tree fc-spanner T must contain all vertices of R, and 
may include some of the vertices of S too. The edges of T may be a combination 
of both terminal and Steiner edges. Note that Steiner edges of G[V{T)] do not 
necessarily have to be spanned in T while terminal edges of G\V (T)] do. Observe 
that, by this definition, it is possible that the distance between two vertices in T 
may be shorter than their distance in G[R\. 

In contrast to tree fc-spanners, it is interesting to consider a decision problem 
as well as an optimization problem defined as follows: 

Problem 1. Steiner Tree /c-Spanner 

Given: G = (i? U S', if) where G[R] is connected. 

Problem: Does G admit a Steiner tree fc-spanner? 

Problem 2. Minimum Steiner Tree /c-Spanner 

Given: G = {Rid S,E) where G[R] is connected, K >0. 

Problem: Does G admit a Steiner tree fc-spanner T such that |S(T)| < K1 

In the latter, we are looking for a Steiner tree fc-spanner of the input graph 
that uses the smallest number of Steiner vertices, if one exists at all. Observe 
that the number |if(T)| of edges of a Steiner tree fc-spanner T is related to the 
number |S(T)| of Steiner vertices: \E{T) \ = |i?| -|- |S(T)| — 1. If we now measure 
the quality of T in terms of \E{T) \ instead of |S'(T) |, then a solution to Problem 2 
is also a minimum Steiner tree fc-spanner in this sense. 




Tree Spanners for Snbgraphs and Related Tree Covering Problems 209 



1.2 Results 

Since Steiner Tree fc-SPANNER is a generalized version of Tree fc-SPANNER, 
the AfP-completeness for A: > 4 is immediate. It remains to consider the cases 
where k is 1, 2, or 3. The case fc = 1 coincides with Tree 1-Spanner because 
no terminal edge can be spanned using some Steiner edges. Thus, it suffices to 
check whether G[i?] is a tree. 

It remains to discuss the cases A: = 2, 3. In Section 2, for k = 2, we develop a 
model in terms of a tree covering problem. Using this, in Section 3 we show that 
Steiner Tree 2-Spanner is AfP-complete, in contrast to Tree 2-Spanner. 
Observe that our model only copes with the case A: = 2; it is inappropriate for 
other stretch factors. But we can use the AfP-completeness for A: = 2 to show 
the AfP-completeness for k = 3. 

If we now assume that we know in advance that a graph G admits a Steiner 
tree A:-spanner for some fixed k, it makes sense to also study the optimiza- 
tion problem. Clearly, the AfP-hardness for this follows directly from the results 
above. In Section 4, we examine the approximability status: Minimum Steiner 
Tree A;-Spanner is hard to approximate within anything better than logarith- 
mic ratio. 

Finally, in Section 5, we give an outlook on further tree covering problems 
that are similar to the one that arises in the light of Steiner Tree 2-Spanner. 
These problems do not directly carry over to the context of Steiner tree k- 
spanners, but we consider them of independent interest. 

Due to space limitations, we do not give details of some proofs within this 
extended abstract but refer to the full version [8]. 

2 A Model for Steiner Tree 2— Spanners 

Instead of trying to solve Steiner Tree 2-Spanner directly, we first examine 
the underlying structure of Steiner tree 2-spanners in order to find a suitable 
model. When dealing with Steiner tree 2-spanners, we have the following situa- 
tion: a terminal edge can either be spanned by itself or by a path of length 2 
that consists either of two terminal edges or two Steiner edges. That means that 
exactly one Steiner vertex may be used for spanning a terminal edge unless it is 
spanned by itself or by two terminal edges. 

2.1 The Role of the Blocks 

Steiner tree 2— spanners in a block. As shown in [2], there is an efficient 
algorithm to decide whether or not a graph admits a tree 2-spanner. Among 
others, this algorithm uses the fact that the blocks may be treated separately. 
Also in the case of Steiner tree 2-spanners, the blocks of G[R] are important: 

Lemma 1. Let G = {R(j S,E), and let G[R] be a biconnected graph that does 
not admit a tree 2-spanner. Then a subgraph T of G is a Steiner tree 2-spanner 
of G if and only if there is a Steiner vertex s € S such that {s,r} € E(T) for 
all terminals r € R (i.e., s is a universal vertex w.r.t. R). 
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Note that this lemma heavily uses the stretch factor of 2; it does not hold for 
larger k. By the previous lemma, it is shown that Steiner Tree 2-Spanner 
can be solved polynomially when G[R] is biconnected. 

Now consider a graph G where G[R] is connected and consists of several 
blocks. Let T be a Steiner tree 2-spanner of G. Then, for every block there is at 
most one universal Steiner vertex in a T: 

Lemma 2. Let G be a graph where G[R] is connected. IfT is a Steiner tree 2- 
spanner of G then for every block H ofG[R], T[V{H)] is either a tree 2-spanner 
of H or there is exactly one Steiner vertex s € S such that {s,r} € E(T) for all 
r G V{H). 

In the following, we say that a Steiner vertex s spans a block H of G[i?] whenever 
s is universal for H and the star centered at s is a subgraph of the Steiner tree 
2-spanner. 

Unfortunately, the existence of a tree 2-spanner or universal Steiner vertex for 
every block is not sufficient. It does not suffice to examine every block separately 
and then simply combine the Steiner tree 2-spanners of every block (as it is the 
case for tree 2-spanners). For example, even if a block admits a tree 2-spanner, 
it may be inevitable to span this block by a Steiner vertex. See Figure 1(a) for an 
example. Figure 1(b) shows an example of a graph where every block has some 
universal Steiner vertex, but there is no Steiner tree 2-spanner for the whole 
graph. The situation in Figure 1(c) is different: r is an articulation vertex that 
is shared by four blocks. Here, the two stars centered at si and S 2 , respectively, 
form a Steiner tree 2-spanner. Thus, we have to take into account the whole 
block structure of G[R\. 

The block-graph. Observe that the blocks of a connected graph have a tree- 
like structure. We model this by a so-called ©-©-tree (see also [7]) consisting 
of two different types of nodes: a ©-node for every block, and a ©-node for 
every articulation vertex. We connect the nodes in a way such that a ©-node 
is connected to a ©-node by an edge, whenever the corresponding articulation 
vertex is contained in the respective block. For reasons of simplicity, we also 
create an additional ©-node for every block that contains only one articulation 
vertex. We refer to this graph as block-graph, denoted by B{G). 
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Altogether, we have the following straightforward properties: B{G) is a tree, 
and the leaves of B{G) are ©-nodes. Moreover, every ©-node (or ©Hnode, res- 
pectively) is adjacent only to ©Hnodes (or ©i-nodes, respectively). 

Certainly, there are also other straightforward methods to consider the block 
structure. But we will see later that we really have to distinguish between articu- 
lation vertices and real blocks. In the sequel, we abbreviate the notation B{G\B\) 
as B{G). Concerning Steiner tree 2-spanners, we get the following observation: 

Lemma 3. If two blocks Hi and H 3 of G[B\ are spanned in a Steiner tree 2- 
spanner T by the same universal Steiner vertex s then also all other blocks that 
lie on the path from Hi to H^ in B{G) are spanned by s in T. 



2.2 An Equivalent Tree Covering Problem 

Our goal is to model Steiner Tree 2-Spanner in terms of an equivalent 
problem that is easier to handle. For this, we need some more notation for the 
block-graph: A subtree of a©K 2 )-tree B is called ©-©-subtree. A ©)-@-subtree T 
of a ©H3)-tree B is called proper if the following holds: whenever a ©Pnode b 
belongs to T then also all ©Hnodes that are adjacent to 6 in B belong to T. The 
intuition here is as follows: whenever a block (corresponding to a ©-node) is 
selected, we also select all incident articulation vertices (corresponding to a ®- 
node). Therefore, in a proper ©-©Hsubtree, the neighborhood of every ©)-node 
is preserved. A collection of proper ©-©Hsubtrees that are node disjoint is called 
a proper ©-©-forest. 

Before starting with the model, we modify the given instance as follows: For 
every block that admits a tree 2-spanner, we create a new Steiner vertex (called 
a fake Steiner vertex, in contrast to real Steiner vertices) . Note that every block 
of G[R] that consists of only a single edge admits a tree 2-spanner and is thus 
treated here. It is easy to see that the new instance is equivalent to the original 
instance. In the sequel, we always use the extended set of Steiner vertices. 

We use the block-graph as a basis for our model. For every Steiner vertex s 
of G, we denote by F^^'> the subgraph of B{G) that is induced by the ©-nodes 
that correspond to blocks that may be spanned by s, together with their adja- 
cent ©(-nodes. Then, is a proper ©)-©)-forest of B{G), consisting of several 
maximal proper ©-©(-subtrees of B{G). Denote these proper ©-©-subtrees by 
. . . , and let T be the collection of the for all (real or fake) 

Steiner vertices s. 

Our goal is to define a tree covering problem in the following sense: Given a 
tree and a collection of subtrees thereof, find a cover of the tree, i.e., a collection 
of the subtrees such that each vertex of the tree is included in a subtree, subject 
to some further constraints. Here, we take B{G) as the underlying tree and T as 
the collection of subgraphs. In the sequel, we develop the constraints such that a 
covering selection of subtrees from T induces a Steiner tree 2-spanner for G and 
vice versa. For this, we translate the facts about Steiner tree 2-spanners that we 
have compiled above into the context of the (B{G),F) tree cover problem: 
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1. Lemma 3 indicates that all ‘intermediate’ blocks must be spanned whenever 
two blocks are to be spanned by one distinguished Steiner vertex. Stated the 
other way round: whenever a block Hi is spanned by a Steiner vertex si, and 
another block H 2 is spanned by a different Steiner vertex S 2 , then no other 
block H 3 such that H 2 lies on the path from Hi to H^ in the block-graph is 
spanned by Si. In the context of the tree cover problem, this means that we 
may choose at most one of for every s. 

2. If we select one particular Steiner vertex s, it is not necessary that we use s for 
all blocks that are potentially spanned by s. In other words, we may pick only 
some of these blocks, as long as these blocks induce a connected component 
within B(G). In the context of the tree cover problem, this corresponds to 
choosing subtrees of the trees contained in each 

3. By Lemma 2, each block has to be spanned by exactly one Steiner vertex. 
That means that each ©-node of B{G) has to be covered exclusively. Since 
articulation vertices may be touched repeatedly, ©hnodes may be covered 
more than once. We can achieve this by considering edge disjoint proper ©)- 
©(-subtrees: Two proper ©(-©(-subtrees of the same ©)-©)-tree that are edge 
disjoint never share a ©(-node. They may share at most one ©)-node. 

Using this, we can model Steiner Tree 2-Spanner as a problem of an edge 
disjoint cover of the ©)-©)-tree by proper ©(-©(-subtrees: 

Problem 3. Edge Disjoint ©(-©(-Subtree Cover 

Given: A ©(-©(-tree B and a set of ©(-©(-forests T = . . . , such 

that each consists of a set of node disjoint proper ©(-©(-subtrees 
of B. 

Problem: Find a cover of the nodes of B consisting of edge disjoint proper 

©(-©(-subtrees of a where every j is chosen at most once. 

Using the remarks above, we can prove that both problems are equivalent: 

Lemma 4. G admits a Steiner tree 2-spanner if and only if there is a solution 
to Edge Disjoint ©)-©(-Subtree Cover with input (B{G),F) as above. 

Furthermore, any instance of Edge Disjoint ©)-©(-Subtree Cover can be 
re-interpreted and re-constructed in terms of Steiner Tree 2-Spanner. 

3 Complexity of Finding Steiner Tree fc— Spanners 

Theorem 1 implies the AfF-completeness of Steiner Tree 2-Spanner: 

Theorem 1. Edge Disjoint ©(-©(-Subtree Cover is -complete. 

Proof. It is clear that Edge Disjoint ©(-©(-Subtree Cover is in MV. We 
show the A/”F-completeness of the problem by a reduction from the Domatig 
Number Problem: We are given a graph G = (V = {vi, . . . ,v\v\},E) and a 
positive integer K < \V\. The problem is to decide whether V can be partitioned 
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into £ > K disjoint sets Vi , V 2 , . . . , Vf such that every Vi is a dominating set for G 
(i.e., such that for every 1 < i < i, every vertex in V is adjacent to at least one 
vertex in Vi)7 Domatic Number is AfP-complete, and remains so for any fixed 
K > 3 (see [5]). W.l.o.g., we can restrict ourselves to the case £= K. 

Let us first review the problem: In a feasible solution Vi, V 2 , . . . , Vk for the 
instance G of Domatic Number, every vertex Vj appears in exactly one of 
the Vj. The Vj can be viewed as bins, and every Vj is put into exactly one of 
them. Moreover, the cumulated, closed neighborhood of all of the vertices in 
each of the bins is V] i.e., U [J V(wj) = V for alH € iV}. 

VjGVi 

Consequently, each of the bins ‘induces’ an isomorphic copy of V. 

In the reduction from Domatic Number to Edge Disjoint ©-©-Subtree 
Cover, the idea is as follows: Each isomorphic copy of V that is induced by one 
of the bins, say Vi, is modeled by an individual component Bi of the block-graph, 
which each contains one ©Hnode for every vertex of V . The adjacencies in G are 
reflected in T . Formally, given an instance of Domatic Number, we construct 
the instance of Edge Disjoint ©)-©)-Subtree Cover as follows: 

— The ©H 2 )-tree B consists of one central ©Hnode c and K isomorphic block- 
graph components Bi for 1 < i < K\ Create a ©)-node ai and connect it to 
\V\ new ©Hnodes 6^ for 1 < j < |E|; add a new ©Unode to every ©-node as 
a leaf. Finally, connect each to the central ©Hnode c. See Figure 2(a). 

~ T consists of 1 -I- \ V\ ©-©-forests . . . , as follows: 

— consists of just one proper ©H©-subtree induced by the nodes 
Qi and by c. 

— For 1 < j < ©I, consists of K disjoint (but isomorphic) proper©)- 
©)-subtrees © for 1 < i < iF, one in each block-graph component Bi. 

contains the ©-node ai and the ©-node bj if Vj is adjacent to ve in 
G, or if ^ = j. Add the corresponding ©-nodes together with all induced 
edges. See Figure 2(b) for an example. 
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The adjacencies of a vertex Vj in G are modeled by Each contains 
all ©-nodes 6^ that correspond to the closed neighborhood of Vj in G (i.e., 
{r)j}UA^(vj)). Observe that all are isomorphic and node disjoint for a fixed j, 
and that none contains the central ©-node c. Moreover, every contains only 
nodes from one block-graph component Bi. 

Thus, if or a proper ©-©(-subtree thereof is selected in a solution 
of {B,T), then this exactly mirrors the case where Vj is put into bin Vi in a 
solution of Domatic Number. In particular, ©(-node Uj mirrors the bin Vi: 
Vi contains a vertex vj if a, is covered by (or a subtree thereof). Since we 
are only allowed to choose at most one ©-©(-tree from every ©(-©(-forest 
this corresponds to the exclusive selection of vertices to bins. 

This construction gives a feasible instance of Edge Disjoint ©-©(-Subtree 
Cover, and it can be constructed in polynomial time. Furthermore, all ©- 
nodes are covered by at least one of the proper ©-©(-subtrees. Lemma 5 
shows the equivalence of the instances. 

Lemma 5. Let (B,F) be as eonstructed above from the instance G of Domatic 
Number. G has a solution of Domatic Number if and only if {B,T) has a 
feasible edge cover. 

A solution for the instance of Domatic Number can be computed efficiently 
from the solution of an instance of Edge Disjoint ©-©(-Subtree Cover and 
vice versa. □ 



Discussion of special cases. We now discuss how some special cases of Stei- 
ner Tree 2-Spanner can be solved polynomially. Observe that we have already 
shown that the problem is polynomially solvable if G[R\ is biconnected. Furt- 
hermore, if the number of Steiner vertices [S'! or the number of blocks of G[i?] 
is bounded by a constant, we can use dynamic programming or even exhaustive 
search to get polynomial algorithms. 

Now consider the case that the block-graph B{G) is a ©-©-star centered at 
a ©(-node. In this case, it is sufficient to check if every block can be spanned 
by at least one Steiner vertex. But, as we will see in the next subsection, even 
in this simple case, we cannot hope for finding a solution to the optimization 
problem (Minimum Steiner Tree 2-Spanner) that is anything better than 
a logarithmic factor of the optimal solution. 



Hardness for k—ti. We can use the hardness of Steiner Tree 2-Spanner 
to show the AfP-hardness of Steiner Tree 3-Spanner. 



Theorem 2. Steiner Tree 3-Spanner is fVV -complete. 
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4 Finding Minimnm Steiner Tree fc— Spanners 

We now turn to optimization problem Minimum Steiner Tree /c-Spanner. As 
a consequence of Theorem 1, Minimum Steiner Tree 2-Spanner is also MV- 
complete. Hence, even if we know in advance that the given instance admits some 
Steiner tree 2-spanner, we cannot hope for finding efficiently one that uses the 
minimum number of Steiner vertices. We now strengthen this result by proving 
an inapproximability result. We do this for the general case of arbitrary k > 2. 
In particular, we show that it is even AfP-hard to find a Steiner tree fc-spanner 
in which the number of Steiner vertices is guaranteed to be within anything 
better than a logarithmic factor of the number of Steiner vertices in an optimal 
solution. 

Since we now consider arbitrary k, in this section, we cannot use the equiva- 
lent formulation as an edge disjoint subtree covering problem. Instead, we show 
the result directly by proving that finding an (approximate) minimum Steiner 
tree /c-spanner is as hard as solving Minimum Hitting Set: Given a collection 
C of subsets of a finite set F, and a positive integer K < |F|, the problem is to 
decide whether there is a subset F' C F with |F'| < K such that F' contains at 
least one element from each subset in C. 

Using the inapproximability results of [4] for the equivalent Minimum Set 
Cover Problem, there is no ((1 — e) log |F|)-approximation algorithm for Mi- 
nimum Hitting Set for any e > 0, unless MV C Note 

that N'T means that MV does not have quasi-polynomial 

deterministic algorithms, a slightly weaker assumption than V yf MV . 

Theorem 3. For any fixed integer k >2, Minimum Steiner Tree fc-SPANNER 
cannot he approximated within ratio (1 — e) log [S'] for any e > 0, unless MV C 
T>T2:7W£:(n'°s'°8”). 

Proof. We prove the hardness of Minimum Steiner Tree /c-Spanner by sho- 
wing that finding an (approximate) minimum Steiner tree fc-spanner is as hard 
as solving Minimum Hitting Set. Starting from an instance (C, F) of Mini- 
mum Hitting Set where C = {Ci, . . . ,Cm} and F = {/i,...,/„}, the idea 
is as follows: A subset Ci corresponds to a block in G that does not contain a 
tree fc-spanner (but a Steiner tree /c-spanner), whereas the elements of F are 
modeled by Steiner vertices. We construct the graph G for Minimum Steiner 
Tree /c-Spanner as follows: 

— Create terminals and terminal edges: For every Ci G C for 1 < t < m, 

create a simple cycle Ri of length k + 2 consisting of vertices V{Ri) = 
{rf , r},. . . , Melt together all vertices rf to form one central vertex r. 

— Create Steiner vertices and edges: For every fe G F for 1 < £ < n, create 
a new Steiner vertex se- For every fi G Ci, connect si to r and to r\ for 
1 < J < fc + 1- 

We get the graph G = {R(j S,E) with Steiner vertices S = {si, . . . , s„} and 
terminals R = {r} U {r\ \ I <i < m, 1 < j < fc -b 1}. Observe that [S'! = |F|. 
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For every Cj € C, there is a block Ri in G, and the members of Ci represent 
Steiner vertices that may span Ri. Observe that G[R] is connected, that the 
blocks of G[R] are formed by the for 1 < i < m, and that the blocks Ri 
do not admit a tree fc-spanner. Moreover, every block Ri admits a Steiner tree 
/c-spanner. Lemma 6 completes the proof. 

Lemma 6. There is a hitting set of cardinality K for the instance (G,F) of 
Minimum Hitting Set, if and only if G as constructed above admits a Steiner 
tree k-spanner containing K Steiner vertices. 

□ 

Observe that the instance G of Minimum Steiner Tree /c-Spanner as con- 
structed above has a block-graph B{G) that is a ©-( 2 )-star centered at a @- 
node. As stated in Section 3 above, in this special case at least Steiner Tree 
2-Spanner is solvable efficiently. 

In the definition of Minimum Steiner Tree /c-Spanner, we have measured 
the quality of a Steiner tree fc-spanner T in terms of the number of Steiner 
vertices. Let us now reconsider the optimization problem with respect to the total 
number of edges. As mentioned above, for any Steiner tree fc-spanner T, \E{T) \ = 
|i?| + |>S'(r)| — 1, and hence a minimum Steiner tree fc-spanner with respect 
to the number of Steiner vertices is also a minimum Steiner tree fc-spanner 
with respect to the total number of edges. By this, the modified optimization 
problem clearly remains AfP-complete. But note that we cannot deduce similar 
results concerning the (in)approximability of this problem from our reduction 
from Minimum Hitting Set, because the approximation ratio may tend to 1. 

5 Related Tree Covering Problems 

Motivated by the subtree covering problem that has appeared in the context of 
Steiner Tree 2-Spanner, we now examine different variations of this kind of 
covering problems. Observe however, that these new problems cannot be trans- 
lated to the context of Steiner tree fc-spanners. We deal with them here since we 
consider them interesting in themselves. In particular, we consider two aspects: 

— Edge disjoint ®-( 2 )-tree Cover by (THD-trees: 

Instead of considering the possibility of choosing a proper subtree of a sel- 
ected ©-©-subtree, we are forced to take the whole selected ©(-©-subtree. 
Observe that Problem 3 is not a direct generalization of this problem. The 
choice of whole ©-©(-subtrees of the F^^'> instead of picking parts thereof 
significantly changes the objective. Hence, the approaches of Theorem 1 do 
not apply here. However, this variant also turns out to be hard. 

— Trees instead of ©(-©(-trees: 

In contrast to the problems considered so far, we now generalize to arbitrary 
trees, not ©)-©)-trees. This yields the corresponding problems Edge Dis- 
joint Subtree Cover and Edge Disjoint Tree Cover. For both pro- 
blems, the complexity status remains AfP-complete. 
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Abstract. A fundamental problem in distributed computing consists 
in tracking causal dependencies between relevant events occurring du- 
ring the computation, named observable events. Several methods have 
been proposed so far in order to track these dependencies on line. They 
require to propagate information among processes participating in the 
computation, by piggybacking additional control data to the computa- 
tion messages. All these methods have to face the problem of the size of 
piggybacked information that could become prohibitive. However, boun- 
ding the size of piggybacked information may lead to the irremediable 
loss of causal dependencies, if the set of observable events is not correctly 
chosen. The challenge is to determine the minimal size of piggybacked 
information, in function of a given set of observable events, allowing to 
track all causal dependencies. This paper provides an answer to this 
previously open problem. This answer is based on the construction of 
a weighted graph modelizing the given computation with its observable 
events. Although the minimal value can be known only when all the com- 
putation is known, it can be used off line to perform a posteriori analysis 
of a computation. 



1 Introduction 

Causality is fundamental to many situations occurring in distributed computati- 
ons, such as distributed debugging, crash recovery, concurrency measure, causal 
ordering in communication protocols, just to cite a few. As realistic distributed 
computations are asynchronous (because processes do not have access to a com- 
mon clock and communication delays are unpredictable), the notion of real-time 
is not pertinent. Reasoning based on the causal structure of a system provides a 
better level of abstraction [1 1] . The notion of causal dependency between events 
has been formally defined by Lamport [9], endowing the set of events of a dis- 
tributed computation with a partially ordered set structure E = {E, -^c) (also 
called the causal dependency graph). Since then, there has been a lot of work 
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aiming to design methods that track causal dependencies during the computa- 
tion. All these methods are based on timestamps associated with events and on 
the piggybacking of information on messages used to update the timestamps. 
Timestamps are integer vectors of size n, where n is the number of processes 
involved in the computation. 

In this paper, we are interested in observation methods which consider an 
additional observer process and a subset O of relevant events (called observa- 
ble events). The observer gets the value of each observable event’s timestamp 
and, from these values, builds an observed dependency graph tracking causal 
dependencies between observed events. With respect to a given distributed com- 
putation and a given set O, an observation method is consistent if it tracks 
only causal dependencies between events in O, and it is complete if it tracks 
all the causal dependencies between events in O. Several consistent observation 
methods have been proposed. 

The transitive dependency method, proposed independently by Fidge [4] and 
Mattern [10] is complete whatever the choice of O (when O is empty, it means 
that E can be constructed on line in a decentralized way, without the need of 
an observer). However, the size of information to be piggybacked is prohibitive, 
since it amounts to n integer values^. In the direct dependency method, proposed 
by Fowler and Zwaenepoel [5], the size of piggybacked information is reduced to 
one integer. But, if O is not correctly chosen, this method is not complete [8] (the 
original paper [5] implicitly assumes O = E, which ensures the completeness). 

The k-dependency method, introduced by [1] is a generalization of the direct 
dependency method, where the size of piggybacked information is bounded by k 
integer values (k can be any integer ranging from 1 to n) . But, except when k = n, 
its completeness requires some conditions on O, depending on the given value 
k. Yet, the only known condition on O, called NIVI and introduced in [6], refers 
to the case k = 1 and is only a sufficient condition. That is, no weaker sufficient 
condition has been stated when fc > 1 (except k = n, where no condition is 
required) and, moreover, no necessary condition has been found so far. 

At the operational level, two approaches have been proposed to ensure com- 
pleteness of observation methods. In the first one [8], the set O is dynamically 
adapted to the size k. More precisely, the NIVI condition is enforced by directing 
processes to define additional observable events when this condition is about to 
be violated. Clearly, since the NIVI condition is only sufficient and designed for 
the case A: = 1, it can be too strong and forces too many observable events. In the 
second approach, the size k is dynamically adapted to the given set O. This ap- 
proach is illustrated by the incremental transitive method [8] which dynamically 
reduces the size of piggybacked information by removing some redundancies. 
But this method does not guarantee that the size of piggybacked informations 
will not grow up to n integer values, depending on O. 

The aim of this paper is to provide a characterization of sets O ensuring 
completeness of observation methods with a bounded size k of piggybacked in- 

^ Following [3] this size is necessary if one requires complete methods whatever the 
set O. 
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formation. Equivalently, the main result is a characterization of the minimal 
value of k to be used in order to ensure completeness with respect to a given dis- 
tributed computation and a given set of observable events. This characterization 
essentially shows that, the more observable events in the computation, the lesser 
k. In some sense, there is a tradeoff between the amount of information that 
must be propagated within the computation (piggybacked information) and the 
number of events that must output the information towards the observer pro- 
cess. To our knowledge, such a characterization (and the compromise which it 
reveals) had never been put forward, and is helpful for a better understanding 
of causality tracking methods. 

The paper is made of three sections. Section 2 presents the computation mo- 
del and causality tracking methods. Section 3 is devoted to the characterization 
result, and Section 4 concludes the paper. 



2 Computation Model 

2.1 Distributed Computation 

A distributed computation is modeled by a finite set of n sequential processes 
{Pi, . . . ,P„} communicating solely by exchanging messages. Each ordered pair 
of processes is connected by a reliable directed logical channel. Transmission 
delays are unpredictable but finite. 

The execution of each process Pi produces a totally ordered set of events 
Ei. An event may be either internal or it may involve communication {send or 
receive event). We denote as li Q Ei, Si C Ei and Ri C Ei the set of internal, 
send and receive events produced by process Pi and as E the set of all the events, 
i.e. E = Uf^iEi. 

Following Lamport [9], we say that an event e locally precedes e! , denoted as 
e — >■; e', if e immediately precedes e' on the same process. Moreover, e — e' if e 
is the sending event of a message and e' is the corresponding receive event. Then 
we can see a distributed computation as a directed graph E = (P,— >-c), where 
-^c is the well known causality relation (or Lamport’s happened-before relation) 
defined as the transitive closure of the relation — >■; U It is well known that E 
is a strict finitary partial order. Two events e and e' are concurrent if -<{e -^c e!) 
and -i(e' — >-c e). 



Notations. Given a directed graph G = {X, — J-^) we will denote its transitive 
closure as G+ = (A, —>•+), its transitive reduction as G~ = (A, —>•“), and 

Vcc G A : (x) = {a;} U {y G X \ y —>■)!" a;} 

In particular, when considering a distributed computation E, we have E~^ = E, 
and the causal past of any e G E is Ic (e) = |e} U |e' G P | e' -^c e}. 
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2.2 Tracking Dependency between Observable Events 

To design efficient distributed algorithms one needs to track causal dependencies 
during the computation. Nevertheless, detecting causal relations or concurrency 
between all internal and communication events is not desirable, so analyzing 
a distributed computation requires to define precisely a set of relevant events, 
called observable events and denoted by O (similarly, Oi = OC\Ei). If I denotes 
the set of internal events of E, we assume O Q I, that is only internal events can 
be observable. Note that this is not a restriction, because if a communication 
event must be “observed” , it is equivalent to create an internal observable event 
immediately following the communication event. At the considered abstraction 
level, the distributed computation is characterized by the directed graph O = 
(0,->-o) defined as the subgraph of E spanned by O, i.e, for every e, e' G O, 
e —>o e! ^ e eb 

Distributed computations can be visualized using space-time diagrams (see 
Figure l.(a)). Without loss of generality, we do not represent non-observable 
internal events. Each observable event is depicted as a dot located on a process 
line and the x-th observable event generated on process Pi is denoted as Ci^x- 
Moreover, each message is represented by an interprocess arrow. Actually, a 
time diagram is a graphical representation of the transitive reduction 0~ . In 
fact, e — >-o eMf and only if there is a path starting at e and ending at eb 

Following [12], the causal past lo (e) of an observable event can be repre- 
sented by an n-dimensional vector of integers V (e) (where n is the number of 
processes), called transitive dependency vector and characterized by 

Vj : ne)[j] = |io(e)nO,| , (1) 

which implies [12] 

V{e)= max E(e'), (2) 

or, equivalently, if Pi is the process producing eb 

V(e')[i] < V(e)[i] e' e. (3) 

Methods used to track causal dependencies between observable events con- 
sist in associating n-dimensional vector timestamps with observable events and 
in piggybacking control informations upon outgoing messages used to update 
timestamps. Moreover, an additional process, called observer process, receives 
the timestamps associated with observable events. Its role is to reconstruct O, 
or, equivalently, to compute exact values V (e) for each observable event, just by 
analyzing the received timestamps. More precisely, the observer process receives 
pairs (e,T(e)) where events e are observable and T(e) is some information con- 
trol associated with e, approximating V (e). Using these information, it builds an 
observed dependency graph, denoted ODG and defined as follows. Its vertices are 
events e received by the observer, labeled with T(e). There is an edge from e to 
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e', denoted e -^odg iff ^(e)[*] < T{e')[i] {Pi being the process producing e). 
If the method is consistent, property (3) ensures e! -^odg e => e' — >-o e. Then, it 
has to backward explore the graph ODG in order to build the transitive closure 
ODG~^ [2,12] and computes, for every vertex e: 

Tobs{e) = max T{e'). 

If ODG^ is isomorphic to O then Tobs{e) = V{e). 

Definition 1. The coding of a set of observable events is complete when ODG^ 
is isomorphic to O, that is the coding allows the observer to detect the correct 
dependencies between all observable events. 

2.3 Examples of Dependency Tracking Methods 

To manage timestamps, each process Pi has a local n-dimensional integer vector 
Vi, initialized to zero, where Vi[j] = x means Cj^x causally precedes the current 
event of Pi. Different methods correspond to different sizes of piggybacking in- 
formations, ranging from 1 to n, which can be statically or dynamically defined. 
In the rest of the paper, a pair (j, Vi[j]) will be called an event identifier on Pi. 
In particular, the pair (z, Vi[i]) is called direct dependency. 

The k-dependency method introduced by [1] actually represents a general 
scheme for dependency tracking based on piggybacking of information whose 
size, represented by the parameter k, is statically defined. Let fc G {1, . . . ,n}, 
and Tfc(e) be the timestamp associated with the event e by piggybacking control 
informations of size k. Rules for updating local vectors and assigning timestamps 
are the following: 

R1 upon observation of an event e on Pp. Vi[i] := Vi[i] + 1 ; Tfe(e) := 

R2 upon sending a message m. Pi piggybacks on this one a list m.L containing 
the direct dependency and k — 1 event identifiers selected according to some 
strategy; 

R3 upon receiving a message m with the list m.L, Vi[j] := max{Vi[j],V[j]), 
Vj : {j,V[j]) G m.L. 

When k = n, the previous scheme reduces to the known transitive dependency 
method [4,10], and T„(e) is really the vector V{e). In this case, whatever the 
choice ofOQI, ODG is isomorphic to O. In particular, it is already transitively 
closed. On the contrary, when k < n, the ODG is not necessarily transitively 
closed, as depicted in Figure 1, and, more important, if the set of observable 
events O is not correctly chosen, ODG~^ is not necessarily isomorphic to O (see 
Figure 2): some transitive information can be irremediably lost if it was not 
previously recorded by an observable event, that is if it has not been previously 
sent to the observer process. So, the completeness can be impacted [8] . 

As an example. Figure 1(b) shows the ODG produced by the 1-dependency 
method depicted Figure 1(a). Each vertex is labeled with the corresponding 
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[010] [100] [001] [ 120 ] [022] [200] [023] [130] 

(a) (b) 



Fig. 1. A distributed computation using the 1-Dependency Vector method (a) and the 
related ODG (b). 



[ 100 ] [ 200 ] 





[010] [100] [001] [ 120 ] [ 012 ] [200] [013] [130] 

(a) (b) 

Fig. 2. A distribnted computation nsing 1-Dependency method (a) and the related 
“incomplete” ODG (b). 



timestamp Ti. Here, ODG~^ is isomorphic to O: if we consider the event 63 ^ 2 ) we 
have Tobs{e3,2) = maXe'eioDG{e 3 , 2 )'^i{(^') = 2, 2] = ^( 63 , 2 )- Figure 2 shows the 

same when the observed event 62,2 has been eliminated. In this case, ODG^ C O. 
In fact, Tobs{s- 3 , 2 ) = 2 )^kW) = [0, 1, 2] instead of the correct value 

^^( 63 , 2 ) = [1)1,2]. This means that the causal dependency eip — >-o 63^2 cannot 
be retrieved from the ODG. This is obvious by observing the figure 2: there is 
no path from eip to 63^2 in the ODG. 

In order to ensure the completeness of coding, two dual approaches have been 
proposed: one consists in “adapting” the set O to the given static value k, the 
other consists in dynamically adapting the value of k to the given set O. 

Adapting O. In [6,8], a sufficient condition on O ensuring the completeness of the 
1-dependency method has been stated. This condition, called NIVI, stipulates 
that after a receive event there must be an observable event before a send event 
(on the same process) occurs. At the operational level, processes have to define 
forced observable events when this condition is about to be violated. If the NIVI 
condition on O is satisfied (or forced), the fc-dependency method is complete, 
for any given fc > 1. In fact, even if in the local vector V) there are more than k 
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different dependency informations, piggybacking the direct dependency is suffi- 
cient to ensure the completeness. The other event identifiers have already been 
recorded in some observable event. They are redundant and their piggybacking 
only improves the performance of the method (in terms of graph availability). 

Adapting the size of piggybacked information. Jard and Jourdan [8] proposed the 
incremental transitive method, which is adaptive with respect to the choice of O. 
In this case the rules for updating local vectors Vi and assigning timestamps are 
the following: 

Rl’ upon observation of an event e on Pp Vi[i] := Vi[i] + 1; T{e) := V) ; 
Vj ^ i do Vi[j] := 0 (reset step) 

R2’ upon sending a message m to Pj, a list m.L is piggybacked towards Pj 
containing all pairs {j,Vi[j\) such that Vi[j] ^ 0; 

R3’ upon reception from Pj with the list m.L, Vj : (j, V[j]) € m.L do Vi[j] := 
max{Vi[j],V[j]), . 

With respect to the /c-dependency method, there are two main differences: first, 
the size of piggybacked information is variable (although it can grow up to 
n). Second, all local information is propagated while no observable event occurs. 
When such event e occurs, the information is recorded in the timestamp T(e) and 
sent to the observer process, so the propagation can stop (reset step). This allows 
a process to “forget” the information received before the last observable event. 
We remark that if the reset step is not performed, a process cannot distinguish 
the “new” information received since the last observable event from the “old” 
ones already transmitted to the observer. It can be argued that when the NIVI 
condition is not satisfied, the reset step is necessary. 

In this paper we are interested in finding the minimal size k of piggybacked 
information in order to guarantee the completeness of the coding with respect to 
a given distributed computation and a given set O. So, the methods considered 
must not introduce additional observable events and thus must execute a reset 
step when an event is observed on a process. The next section investigates this 
relationship. 



3 Characterizing the Minimal Size of Piggybacked 
Information 

3.1 Construction of a Weighted Covering Graph 

Given a distributed computation E and a given set O C / of observable events, 
the minimal value of k ensuring a complete coding of O when the size of piggy- 
backed information is limited to k will be characterized. This characterization 
is based on the construction of a weighted graph from which this value can be 
easily computed. 

The idea is based on the following observation. Let us consider a process 
Pi where a sequence of I consecutive receive events occurs, each message being 
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received from a different sender. If the first event following this sequence is a 
send event, it means that the sent message must propagate I causal dependency 
informations towards the observable event that will follow receive{m). If, on the 
contrary, the first event following this sequence is observable, then all causal 
dependency information brought by these I messages is transferred towards the 
observer, and consequently has not to be propagated on the messages that will 
be sent by Pi after the observable event. Thus, every message sent appears to 
be the messenger of a certain amount of information, depending on how many 
causal information is stored on the sending process. However, this view is too 
pessimistic, because it does not take into account the fact that some information 
could be transferred from an observable event to another via several alternative 
paths in the distributed computation. 

A careful analysis of this situation leads to the construction of a weighted 
graph on the set O, whose maximum edge weight is equal to the required minimal 
value of k. The construction of this graph is carried out in three steps: (1) a 
propagation graph, built on O and additional vertices modelizing the sequences 
of receive events followed by a send event, (2) a weighted covering graph, whose 
transitive closure is isomorphic to O, obtained by suppressing additional vertices 
from the propagation graph and computing weights accordingly, and (3) the 
transitive reduction of the weighted covering graph, whose aim is to remove 
redundant information transmission. 

Propagation Graph. Let PG = (V, ~^pg) defined as follows. V = OU P, where 
an O- vertex is generated by each observable event, and a P- vertex is generated 
every time that a non empty sequence of consecutive receive events is followed 
by a send event (on the same process). Given two vertices X and Y generated on 
the same process Pi, we say that X = pred{Y) (or equivalently Y = succ(X)) if 
the events generating X and Y occur in Pi in that order, and there is no vertex 
between X and Y. With every message m of the computation are associated 
two vertices m and m in V. m is the vertex corresponding to the last non send 
event that locally precedes send(m), and m is the vertex corresponding to the 
first non receive event that locally follows receive{m), that is m is an O- vertex 
if receive{m) is followed by an observable event, a P-vertex otherwise. There is 
an edge X -^pg Y iS X = pred{Y) or there exists a message m such that X =m 
and Y =m. O-vertices are labeled like their corresponding observable event, 
whereas Pi^x,h denotes the h-th P-vertex on process Pi, between O-vertices Ci^x 
and Ci^x+i- 

As an example, in Figure 3 is shown the propagation graph (b) associated 
with the distributed computation depicted in (a). O-vertices are depicted as 
rounded rectangles, whereas P-vertices are depicted as sharp rectangles. 

Reduction Procedure to Weighted Govering Graph. The weighted covering graph 
WCG = (0,->-iucg) is a partial representation of O where an edge from e to e' 
represents a path in the time diagram starting form e and ending to e' without 
observable event between e and e'. At the end of its construction, the weight 
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Fig. 3. A distributed computation (a) and its associated propagation graph (b). 



of each edge from e to e! will represent the minimum size of the piggybacked 
information which guarantees that the information on e is propagated towards 
^ without any observable events after leaving e. This graph is built from the 
propagation graph by iteratively removing its P-vertices and replacing the re- 
moved edges by weighted ones linking only O- vertices. Rules for building WCG 
are described as follows. Initially, WCG = PG with all weights equal to 1. For 
each P- vertex X whose all predecessors are O- vertices: 

— for each Y such that Y -^wcg X, 

if -'(I^ -^wcg succ(X)) 

then create an edge Y —>wcg succ(X) with w(Y, succ(X)) = w(Y,X), 
else w{Y,succ{X)) = min(w(F, succ(X)), A")). 

— for each Z such that X -^wcg Z, create an edge prec{X) -^wcg Z with 
w{prec{X), Z) = 1. 

— for each Y ^ prec{X), Z ^ succ{X) such that Y -^wcg X -^wcg Z, 

if “'(P -^wcg Z) 

then create an edge Y -^wcg Z with w{Y, Z) = \{y\y -^wcg 
else w{Y, Z) := mm{w{Y, Z), \{y\y -^^^g X}\. 

— remove X from WCG. 

The construction stops when there are no more P-vertices in WCG. It is easy 
to see that, by construction, WCG^ = O. 

In the figure 4 is depicted an execution of the reduction’s procedure, the 
propagation graph (a) being the input and the weighted covering graph (c) the 
output obtained after eliminating two P-vertices P 2 ,i,i and P 3 , 2 ,i, as shown in 
(b) and (c), respectively. 

Transitive Reduction of the Weighted Covering Graph. The Weighted Covering 
Graph obtained in the previous section can exhibit different paths between two 
observable events. Clearly, if a path is “doubled” by an edge, this indicates that 
the information transmitted along the edge is also transmitted by the path. So, 
removing the edge does not modify the possibilities of information transmission 
represented by the rest of WCG. This well-known procedure, called transitive 
reduction, can be carried out on W CG, to obtain the graph W CG~ . This graph 
allows to state the desired characterization. 
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ei,i ei,2 ^1.2 







Fig. 4. An example of execution of the reduction procedure (a,b,c) on the propagation 
graph associated with the distributed computation depicted in Figure 3. 



3.2 The Characterization 

Theorem 1 (Minimal piggybacking size). Let E = (E,^) be a distributed 
computation, and O be a set of observable events. The value k necessary and 
sufficient to be used in an observation method in order to obtain a complete 
coding of the sub-order O is the maximum weight in the transitive reduction 
WCG~ of the weighted covering graph, that is: 

Wmax = max{'u;(e, e) \ e -^~^g e'}- 

Proof. The proof is made of two parts: necessity and sufficiency. 

Necessity. We show that, if A: < Wmax then no coding can be complete. For that 
purpose, it is sufficient to exhibit a scenario where there exists an observable 
event e such that Tobs{e) yf V{e), where T(e) is the timestamp of e in this 
coding. The figure 5 describes a computation with observable events (a) and its 
associated graph WCG~ (b) built according to the previous procedures. From 
WGG~ follows that Wmax = 3. Now, let’s consider a coding allowing only k = 2 
event identifiers per message. Figure 6 shows the values piggybacked on messages 
(a), the resulting timestamps and the ODG graph (b). From this follows that 
Tobs{s4,2) = [Oj 151,2] 1^(64, 2) = [1,1, 1,2]. Clearly, the causal dependency 

ei^i -^o 64,2 cannot be retrieved. Remark that another choice of vector entries 
to piggyback on the message m (in bold on Figure 6-a) would have led to the 
loss of causal dependency 62,1 — fo 64,2 instead. 

Sufficiency. We show that, if fc > Wmax, it is possible to obtain a complete 
coding. To this purpose, we define a protocol that ensures a complete coding 
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(a) (b) (omitted weights are equal to 1) 

Fig. 5. A computation and its associated WCG . 




Fig. 6. Loss of causal dependencies. 



under this assumption. This protocol is based on the fc-dependency method with 
a reset step. It is based on the following rules (see rules Rl’ to R3’ in Section 
2.3): 

Rl” upon observation of an event e on Vi[i] := Vi[i] + 1; T{e) := Vi; Vj yf 
* • ^i[j] •= 0 ; record (e,T(e)) on the observer (reset step) 

R2” upon sending a message m, Pi piggybacks on this one a list m.L containing 
the pair {i,Vi[i]) and at most k — 1 pairs (j, Vi[j\) such that Vi[j] yf 0; 

R3” upon receiving a message with the list m.L, for each j such that (j, V\f\) G 
m.L, Vi[j] := max{Vi[j],V[j]). 

This protocol provides a complete coding. Due to limitations, the detail of this 
proof is omitted. The complete proof can be found in [7]. It consists in proving 
by induction on the poset O that if fc > Wmax, then Ve G O : Tobs(e) = ^(e). □ 

As an example, Figure 4.(d), representing the transitive reduction of the weighted 
covering graph (c), shows that Wmax = 1 in such a case. Previous property affirms 
that it is necessary and sufficient to piggyback upon messages only one integer 
in order to ensure the completeness of the coding of O. 
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4 Conclusion 

Graph modelization is a powerful tool for analyzing properties of distributed 
computations. This technique has been used in this paper in order to determine 
the minimal size of piggybacked information ensuring the completeness of cau- 
sal dependency tracking between a given set of observable events. This requires 
to associate a graph with the computation and to perform operations trans- 
forming this graph into an transitively reduced weighted graph (WCG). This 
transformation is similar to a transitive closure followed by a transitive reduc- 
tion. This contribution is important, because, to our knowledge, no completeness 
characterization had been presented before. Such a characterization provides a 
better understanding of causality tracking techniques. Although the weighted 
graph WCG can be completely determined only when all the computation is 
known, it can be used off line to perform a posteriori analysis of a computation. 
A future work would be to find completeness conditions, less restrictive than 
the previously known NIVI condition, that could be tested on line. This result 
constitutes a theoretical basis for such an investigation. 

References 

1. R. Baldoni, M. Mechelli and G. Melideo. A General Scheme for Dependency 
Tracking in Distributed Gomputations, Technical Report no. 17-99, Dipartimento 
di Informatica e Sistemistica, Roma, 1999. 

2. P. Baldy, H. Dicky, R. Medina, M. Morvan and J.-M. Vilarem. Efficient Reconstruc- 
tion of the Causal Relationship in Distributed Systems. Proc. 1st Canada- France 
Conference on Parallel and Distributed Computing, LNCS #805, pp. 101-113, 1994. 

3. B. Charron-Bost. Concerning the size of logical clocks in distributed systems. In- 
formation Processing Letters, 39, 11-16, 1991. 

4. G. Fidge. Timestamps in message passing system that preserve the partial ordering, 
Proc. 11th Australian Computer Science Conf., 55-66, 1988. 

5. J. Fowler and W. Zwanepoel. Causal Distributed Breakpoints, Proc. 10th IEEE 
Int. Conf. on Distributed Computing Systems, pp. 134-141, 1990. 

6. E. Fromentin, C. Jard, G.-V. Jourdan and M. Raynal. On-the-fly Analysis of Dis- 
tributed Computations, Information Processing Letters, 54:267-274, 1995. 

7. J.M. Helary and G. Melideo. Minimal Size of Piggybacked Informa- 
tion for Tracking Causality: a Graph-Based Characterization, Rese- 

arch Report # 1300, IRISA, Universite de Rennes 1, February 2000. 
http : //www. irisa.fr/EXTERNE/bibli/pi/ 1300/ 1300.html. 

8. C. Jard and G.V. Jourdan. Incremental Transitive Dependency Tracking in distri- 
buted computations. Parallel Processing Letters 63, 427-435, 1996. 

9. L. Lamport. Time, clocks, and the ordering of events in a distributed system. 
Communications of the ACM, 217, 558-564, 1978. 

10. F. Mattern. Virtual time and global states of distributed systems, M. Cosnard and 
P. Quinton eds., Parallel and Distributed Algorithms 215-226, 1988. 

11. P. Panangaden, K. Taylor. Concurrent Common Knowledge: Defining Agreement 
for Asynchronous Systems, Distributed Computing 6(2), 73-93, 1992. 

12. R. Schwarz and F. Mattern. Detecting causal relations in distributed computations: 
in search of the holy grail, Distributed Computing 7(3), 149-174, 1994. 




The Expressive Power and Complexity of 
Dynamic Process Graphs 



Andreas Jakoby^’^, Maciej Liskiewicz^’^, and Riidiger Reischuk^ 



^ Institut fiir Theoretische Informatik, Medizinische Universitat zu Liibeck, 
Germany. { jakoby , liskiewi ,reischuk}@inf ormatik.mu-luebeck. de 
^ Department of Computer Science, University of Toronto, Canada. 
jakobyScs .toronto . edu 

® Instytut Informatyki, Uniwersytet Wroclawski, Poland. 



Abstract. A model for parallel and distributed programs, the dynamic 
process graph, is investigated under graph-theoretic and complexity as- 
pects. Such graphs are capable of representing all possible executions of 
a parallel or distributed program in a very compact way. The size of this 
representation is small - in many cases only logarithmic with respect to 
the size of any execution of the program. An important feature of this 
model is that the encoded executions are directed acyclic graphs with a 
regular structure, which is typical of parallel programs, and that it em- 
beds constructors for parallel programs, synchronization mechanisms as 
well as conditional branches. 

In a previous paper we have analysed the expressive power of the general 
model and various restrictions. Furthermore, from an algorithmic point 
of view it is important to decide whether a given dynamic process graph 
can be executed correctly and to estimate the minimal deadline given 
enough parallelism. Our model takes into account communication delays 
between processors when exchanging data. 

In this paper we study a variant with output restriction. It is appropriate 
in many situations, but its expressive power has not been known exac- 
tly. First, we investigate structural properties of the executions of such 
dynamic process graph s 0. A natural graph-theoretic conjecture that 
executions must always split into components isomorphic to subgraphs 
of Q turns out to be wrong. We are able to establish a weaker property. 
This implies a quadratic bound on the maximal deadline in contrast to 
the general case, where the execution time may be exponential. Howe- 
ver, we show that the problem to determine the minimal deadline is still 
intractable, namely this problem is MEXPTIME-compAete as is the 
general case. The lower bound is obtained by showing that this kind of 
dynamic process graph s can represent certain Boolean formulas in a 
highly succint way. 



1 Introduction 

Large parallel or distributed computations tend to have a lot of regularities. For 
example, the same instruction sequence may be executed by many processors in 
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parallel. To describe the elementary steps and the logical dependencies among 
them one can use graphs, often called data flow graphs. One would like to keep 
this description as compact as possible, for example not to unfold parallelism if 
this is not necessary. For this purpose, we have introduced the model of dynamic 
process graphs in [3]. For a more detailed motivation the reader may check there. 

The novel property of this graph model are different modes for the input 
and output behaviour that can be assigned to each node, which models a pro- 
cess of the computation. This allows us to model basic primitives for writing 
parallel programs, like fork and join. The two modes are called ALT and PAR. 
To represent such parallel programs in a natural way, we introduce dynamic 
process graphs, which are generalizations of standard precedence graphs. A dy- 
namic process graph is an acyclic graph G = (V,E) with two sets of labels 
l{v),0{v) G {PAR, ALT} attached to the nodes v GV. Nodes represent tasks, 
edges dependencies between tasks. A complete formal definition will be given in 
the next section. The label I{v) describes the input mode of task v. If I{v) = ALT 
then to execute v at least one of the direct predecessor tasks has to be comple- 
ted. I{v) = PAR requires that executions of all direct predecessors of v have to 
be completed before v can start. If task v gets executed then according to the 
output mode 0(v) one of its direct successors in case 0(v) = ALT (resp. all of 
them in case 0(v) = PAR) has to be executed as well. 

If one restricts the mode of nodes to PAR only, both for input and output, 
then this will be equivalent to ordinary data flow graphs. Using both modes, 
however, the representation of parallel programs by dynamic process graphs 
can provide an exponential compaction compared to the length of the actual 
execution sequences. Given a dynamic process graph, the first question that 
arises is whether it describes a legal parallel program. If yes then one would like 
to find an efficient execution of the program specified in such a compact form. 
We assume here that enough parallelism is available, so that the question turns 
into the problem to execute the program as fast as possible. 

Dynamic process graphs and Boolean circuits are somehow related. We have 
shown that such graphs can be used to model computations of a circuit. This 
has then been used to prove that indeed dynamic process graphs that can use 
arbitrary combinations of modes provide a very compact nontrivial representa- 
tion. To find an optimal schedule, which is AfP-complete for ordinary graphs, 
turns out to be Af£AtPEZA4£-complete. A similar complexity jump has been 
observed for classical graph problems in [1,5,4]. These papers have shown that 
simple graph properties become AfP-complete when the graph is represented 
in a particular succinct form using generating circuits or a hierarchical decom- 
position. Under the same representation graph properties that are ordinarily 
A/'P-complete, like HAMILTON CYCLE, 3-COLORABILITY, CLIQUE (with 
half the graph size), etc., become Af£AiVTXA4£-complete. 

If we put restrictions on the modes of a dynamic process graph its execution 
becomes easier. We have given a precise classification in [3] for all possible com- 
binations, except the most interesting case where the output mode is restricted 
to PAR, but the input mode may use both alternatives. Such graphs are still 
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able to model, for example, the two natural ways in which objects of an object- 
oriented languages can be activated: the total case, where all input parameters 
have to be specified before activation, and the partial case, where an object fires 
for any specified parameter once. 

For dynamic process graphs with PAR output mode we could show that 
finding an optimal schedule is at least yB'H 2 ~hard. There was no matching upper 
bound, only the NEXVTXME upper bound of the unrestricted case. Other 
combinations turn out to be less powerful. Restricting the input mode to ALT 
the problem becomes P-complete, and also fixing the output mode the problem 
gets A/’£-complete. 

As main results of this paper we show the following two complexity bounds. A 
linear upper bound on the length of executing dynamic process graphs with PAR 
output mode, whereas in the unrestricted case this bound may be exponential. 
Thus, restricting the expressive power of this graph representation decreases 
the time complexity of parallel and distributed programs drastically. On the 
other hand, we prove that computing an optimal execution in a setting that 
takes communication delays into account remains hard - this problem is still 
M£ T VT TM. f-complete . 

In the next section, we will give a formal definition of this graph model. In 
section 3 we exhibit structural properties of executing dynamic process graphs 
with PAR output mode restriction, which will imply the linear upper bound. 
Section 4 considers the scheduling problem and gives a general outline how the 
AfiPTPTTAdf -hardness result can be shown. Due to space limitations, a bunch 
of technical details have to be omitted. 

2 Dynamic Process Graphs and Runs 

Given a DAG G = (V,E) with node set V and edge set E, iox v & V let 
pred(t)):= {ui, . . . , Up\ denote the set of its direct predecessors, and succ(v):= 
{ici, . . . ,Wr} its direct successors. pred*(t)) is the set of all anchestors of v. 

Definition 1. A dynamic process graph, DPG for short, Q = {V,E,I,0) 
consists of a DAG (directed acyclic graph) with node set V and edges E and 
two node labellings 1,0 : V ^ {ALT, PAR}. V = {vi,V 2 ,--- ,Vn} represents a 
set of processes and E dependencies among them. I and O describe input, 
(resp. output} modes of the Vi. 

A finite DAG Elg = {W,E) is a run of Q iff the following conditions are 
fulfilled: 

1. The set W is partitioned into subsets W{vi) U W{v 2 ) U . . . U W(vn). The 
nodes in W{vi) are execution instances of the process Vi, and will be called 

tasks . 

2. Each source node ofG, which represents a starting operation of the program 
modelled by Q, has exactly one execution instance in Hg. 

3. For every v € V with pred{v) = (rti,... ,Up\ and succ(v) = {wi,... ,Wr} 
and every execution instance x &W{v) it holds: 
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- if I{v) = ALT then X has a unique predecessor y € IV(ui) for some i < p; 

- if I{v) = PAR then pred{x) = {y\, . . . , y^} with yi € W (ui) for each i < p; 

- i/0(v) = ALT then X has a unique successor z € W{wj) for some j < r; 

- ifO{v) = PAR then succ(x) = {zi, . . . , Zr} with zj € W{wj) for each j < r. 

We call a DPG G executable iff there exists a run for it. 

Given G with run Hg = (W, F), for each edge {u, v) in G we define F{u, v) := 
{{y,z) G F \ y G W{u) and z G W(y)}. 

Throughout the paper we will illustrate the ALT-mode by a white box and the 
PAR-mode by a black box. To represent a node of a DPG with input and output 
mode e.g. ALT, resp. PAR we will use two boxes: white at the top representing 
the input mode and the black box at bottom for output mode. For a source or 
a node with indegree 1 the input mode obviously is inessential. Hence we will 
ignore such a label, and similarly the output label in case of a sink or a node 
with outdegree 1. 

Fig. 1 gives an example of a DPG and two runs of it. DPGs can be used 
to specify parallel programs in a compact way. A run corresponds to an actual 
execution of the program. Observe that a run can be smaller than its defining 
DPG (see [3] Fig. 6). More typically, however, a run will be larger than the 
DPG itself since the PAR-constructor allows process duplications. The runs in 
Fig 1 illustrate this property. The following lemma gives an upper bound on the 
blow-up, resp. the possible compaction ratio of DPGs. 

Lemma 1 ([3]). Let G = {V,E,I,0) be a DPG and Fig = (W,F) he a corre- 
sponding run. Then \W\ < and this general upper bound is best possible. 

Gertain DPGs thus have processes with exponential many execution instances. 

The motivating question for our study of DPGs is how efficiently a compactly 
specified parallel program II can he executed. Note that this is a crucial problem 
during compile-time. Furthermore, we like to construct an optimal schedule for 
the execution tasks of II (compare e.g. [2]). Lemma 1 implies an upper bound 
for the execution time of such a compactly presented program. However, this 
bound may be much too pessimistic for massive parallelism. With respect to 
their size, DPGs may require exponential many execution instances for some of 
its processes. However, if enough processors are available it may be possible to 
construct runs that use only linear time by executing many tasks in parallel. 

Definition 2. Let G = (V,E,I,0) be a DPG and H = (W,F) be a run of G. 
A subrun 7?.(y) of H is a subgraph that is induced by one of its sink nodes 
y, i.e. it consists of a sink y and all nodes in pred*(?/) together with all their 
connections in F. We call a suhrun 'R-{y) of H fe-overlapping iff for every node 
V G V it holds |7^(j/) n W{v)\ < k I. A run H is k-overlapping if all its 
subruns are k-overlapping. We call H non-overlapping if it is 0- overlapping. 

Note that non-overlapping is a slightly stronger condition than the require- 
ment that each subrun has to be isomorphic to a subgraph of G. The isomorphism 
requirement seems to be a very natural condition. Fig. 1, however, shows that 
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it does not have to hold, even for DPGs with output restriction PAR. The two 
execution instances of the sink w induce two different subruns. For the run illu- 
strated in (b) each subrun is non-overlapping, it is isomorphic to a subgraph of 
G- In (c) each subrun contains both execution instances of process t, that means 
it is 1-overlapping. 




Fig. 1. (a) A DPG with 2 nonisomorpbic runs: process t has 2 execution instances, and 
the same holds for all its successors; (b) a non-overlapping run, (c) a run with overlap 
at W(t). Recall, that in a schematic representation of node v of a DPG a white box 
pictures the ALT -mode and a black box - the PAR-mode. 



The maximal size of a subrun of H gives a better upper time bound for 
executing H than just the size of H because all subruns can be executed inde- 
pendently in parallel. Moreover, if a DPG G has a non-overlapping run then its 
processes can be executed in linear time with respect to the size of G- 

Next, we give a formal definition for scheduling DPGs. Let ^ be a parameter 
describing the communication delay of the system. For the lower bounds it suffi- 
ces to treat <5 as a constant independent of the processes between which data is 
exchanged. Furthermore, it suffices to assume unit-execution time for all proces- 
ses. Our algorithms achieving matching upper bounds can even handle variable 
delays and execution times. 

To schedule a task graph H = {W, F) with delay <5 we assume that an unbo- 
unded number of processors can be used. In each unit-time interval a processor 
P can execute one single task w. In order to schedule w at time t each direct 
predecessor of w must have already been executed - either by P itself in previous 
time intervals or by some other processor by time interval t — 1 — 6 such that 
the result of this predecessor can arrive at P on time. Scheduling tasks graphs 
in the presence of communication delays has first been considered in [6]. 

Definition 3. A schedule S for a DPG G with delay S is a schedule of a run 
FI — {W,F). Let T{S) denote the duration of S, i.e. the point of time when S 
has executed all tasks, and Topt{Q,S) := 

schedule for (Q,S) * 
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For lower bounds on the complexity of this problem, instead of the optimiza- 
tion version to compute Topt{G, S) it will suffice to consider the decision problem. 



Definition 4 (DYNAMIC PROCESS GRAPH SCHEDULE (DPGS)). 

Given a DPG G with communication delay S and a deadline T* , does Topt{G, S) < 
T* hold? 

3 The Structure of Runs 

Lemma 1 has shown that the size of a run can be exponential with respect 
to the size of its underlying DPG. But this does not imply that an optimal 
parallel execution time of a run requires exponential length, too, because each 
of its subruns can be small. In this section we investigate the problem what is 
the minimal-overlapping run for a given DPG G- A simple thought shows that 
ii G = (V,E,I,0) possesses a non-overlapping run then independently of the 
communication delay one can achieve the bound Topt{G, < \V\. 

Lemma 2. Let G he a DPG with input mode restriction, either I = ALT, or 
I = PAR. Then every run of G is non- overlapping. 

The proof of this claim is easy: In case / = ALT each subrun is a simple 
path from a sink back to a single source. If / = PAR then one can show by a 
topological induction starting with the sources that no process can have more 
than one execution instance. 

The non-overlapping property does not hold anymore when the input mode 
may vary - even if one restricts the output mode. In case of O = ALT it is easy 
to construct a DPG G such that each run of G is not non-overlapping. However, 
we can show: 

Lemma 3. Let G = (V,E,L,0) be a DPG with k sources and O = ALT. Then 
G has a (k — 1) -overlapping run. 

This claim can be shown by a flow- argument. The requirement for the sources to 
have exactly one execution instance generates an initial flow of size k from the 
sources to their direct successors. The ALT output mode implies that the flow 
does not increase. On the contrary, an internal node v with input mode PAR will 
decrease the flow by |pred(w)| — 1. The flow leaving a node corresponds exactly 
to the number of execution instances this process requires. 

Thus, in case of ALT output mode runs have a quadratic increase at most. 
In the unrestricted case, this property does not hold anymore as the following 
result shows. 



Lemma 4. There exists a family G\,Gi, Ga, ■ • • of DPGs Gk = (hfc, Ek, Ik, Ok) 
with \ Vk \ =2k-\-l such that every run Hk of Gk has a unique subrun TZ - which 
is ( 2 (Afcl-i)/ 2 -i _ -overlapping. 
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The DPGs are constructed as follows. Qk consists of 2fc+l nodes Vk := { Vi | 1 < 
i < 2k + 1 } such that the first k+l nodes form a complete DAG as well as the 
last k + l nodes: 

Ek ■= { {vi, Vj)\l<i<j<k + 1} U { (vi, Vj) I fc + 1 < z < j < 2fc + 1 }. 

Furthermore, define the modes as follows: I{vi) is ALT for z < k+l and PAR for 
z > A: + 1. Moreover 0{vi) is PAR for z < fc + 1 and ALT otherwise. Now, if the 
delay 5k for Qk is chosen large enough - 5k = \Hk\ = i ■ 2^~^ suffices - then it 
does not pay to utilize more than 1 processor, since Hk has only one sink. 

Corollary 1. For the family Gi,G 2 ,--- defined above it holds Topt{Gk,5) > 
3 . if the communication delay is sufficiently large. 

Now, let us investigate the most interesting restricted case, namely DPGs with 
O = PAR, in the following called PAR-output DPGs. As one can see in 
Fig. 1 such DPGs may have overlapping as well as non-overlapping runs. Hence, 
a natural question is whether for any executable PAR-output DPG G one can 
always find a non-overlapping run. The example in Fig. 2 gives a negative answer 
to this question. However, the situation cannot be too bad, we can show an 
exponential improvement of the trivial upper bound. 




Fig. 2. An executable PAR-output DPG (left) with no non-overlapping run. 



Theorem 1. Every executable PAR-output DPG G = (,V,E,I,0) has a k- 
overlapping run where k := [log 2 max„gy |PF(z;)|J. 

A /c-overlapping run can be executed in parallel: for every subrun a unique 
processor completely processes all of its execution instances. This requires at 
most \V\ ■ {k + 1) many steps. Together with Lemma 1 we thus get 
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Corollary 2. If a PAR-output DPG Q = (V,E,I,0) can he executed at all it 
can he executed in time 0{\V\'^). 

Below we sketch the idea how to prove Theorem 1. Assume that G is an exe- 
cutable PAR-output DPG with nodes V = {t;i, . . . , Vn}- For a run Hg of Q define 
the characteristic vector as the sequence (|lP(ui)|, \W{v2)\, ■ ■ ■ , |kF(w„)|) and 
l{G) '■= [log2 max„gy |lP(u)|J . Note that on any path rijj , . . . ,Vi^ from a source 
to a sink the sequence of multiplicities must be increasing. Furthermore, 

we have proven that for PAR-output DPGs G all runs have exactly the same cha- 
racteristic vector [3], thus ^{G) is independent of a particular run and hence well 
defined. We will use five steps to transform G into a graph G 



G ^ Qa ^ Gb 



(C) 
— s> 



Gc 



(D) 
!■ 



Gd 



(E) 
!■ 



Ge 



G 



in such a way that the intermediate DPGs fulfill the following increasing list of 
properties (that means Ga fulfills (A), then Gb fulfills (A) and (B), etc): 

(A) every node with input mode ALT has indegree exactly two; 

{B) every node with input mode PAR has only direct predecessors and successors 
with input mode ALT ; 

(C) the sources and sinks have input mode PAR: on every path from a source to 
a sink the input mode alternates between PAR and ALT; 

(D) for every value k < k := max„gg |lP(v)| there exists a unique PAR-input 
node Mfc with \W{uk) \ = k] 

(E) for all triples (i,j,k) with i + j = k < k considering the unique nodes 
Ui, Uj,Uk according to (D), there exists a unique node Vij^k in Ge with input 
mode ALT such that {u^,Vij^k), (uj,Vij^k), (v^j^k^Uk) are edges in Ge- 

Property (D) and the monotonicity of the multiplicities |kF(ufe)| imply that 
the sink is unique and achieves maximal multiplicity k. 

These transformations expand as well as shrink subgraphs of G in order to 
achieve this unique layered structure. Although the size of G can be exponential 
in the size of t/, we can guarantee that the 7- values do not change, in particular 
'jiG) = j{G)- Furthermore, the property that 7(t/)-overlapping runs exist is 
preserved as well. We call a DPG fulfilling property (A) to {E) a complete 
PAR-output DPG Ck, where k denotes the multiplicity of the sink of Ck- 

Lemma 5. Every complete PAR-output DPG Ck has a j{Ck)-overlapping run. 

One might be temptered to guess that a run with minimal overlap serves as 
the basis for a schedule of minimal length. But this is not correct. Overlapping 
may be crucial to design optimal executions of DPGs as it is shown by the 
following result. A proof can be found in the full paper, where we construct a 
family of DPGs exhibiting this behaviour. 

Theorem 2. There exist PAR-output DPGs G with an appropriate communi- 
cation delay S such that no non- overlapping run of G can achieve the minimal 
schedule length To-pt{G,S). 
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4 The Complexity of Scheduling 

The previous section has shown a linear bound on the maximal execution time 
for PAR-output DPGs. On the other hand, execution times of unrestricted DPGs 
may be exponential. In this section we prove that the computational complexity 
to determine the minimal execution time for PAR-output DPGs is still very diffi- 
cult, namely the scheduling problem for such DPGs is J\f SXVTXJ^S-coTa\)\ete 
as in the general case. 

The AfiPTPTTAdiP-completeness of the scheduling problem in the unrestric- 
ted case has been proved in [3]. Furthermore, we have shown that for any variant 
with input or output mode restrictions, except the case of PAR-output DPGs, 
the computational complexity of the scheduling problem decreases drastically, 
namely it can be solved in J\fV, in some cases even with less effort. These upper 
bounds have been matched with corresponding lower bounds. 

For scheduling PAR-output DPGs we were able to prove ,B'H 2 -hardness. This 
means that the scheduling problem is probably more difficult for PAR-output 
DPGs than for the other restrictions. In this section we improve the lower bo- 
und showing that the problem is J\f SXVTXXiS-haid. Since an optimal schedule 
of PAR-output DPGs can always be found in AfSXVTXXiS as in the unre- 
stricted case the lower bound is best possible. The scheduling problem is thus 
M£XVTTM£-com]Aeie. 

Theorem 3. Scheduling PAR-output DPGs with constant communication delay 
is M£XVTTM£-hard. 

The hardness result is obtained by showing that PAR-output DPGs can represent 
certain Boolean formulas in a highly succinct way. A sketch of this construction 
will be given in the rest of this section. For the reduction we will use the following 
M£XVTTM£-com]Aete problem [5]: 

Definition 5 (SUCCINCT-3SAT). As input we are given a Boolean circuit 
over the standard AND, OR, NOT-basis that succinctly codes a Boolean formula 
in conjunctive normal form with the additional property that each clause has 
exactly three literals and each literal appears exactly three times. Suppose that 
the encoded formula consists ofn variables and m clauses. On input (0, i, k) with 
i G {0,... ,n— 1} and k G {1,2,3} (appropriately coded in binary), the coding 
circuit returns the index of the clause where the literal -'Xi appears the k-th time. 
On input (1, i, k) it returns the index of the clause where Xi appears for the k-th 
time. On input (2,j,k) with j G (0,... ,m — 1} and k G {1,2,3}, it returns 
the k-th literal of the j-th clause. The problem is to decide whether the encoded 
formula is satisfiable. 

4.1 Boolean Circuits and Dynamic Process Graphs 

In [3] the first step of the Af£ X'PTTAAS-haxdness proof for the general case made 
a transformation of a given Boolean circuit B into a DPG Qb such that runs of 
Qb simulate computations of B on arbitrary input assignments. To encode an 
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arbitrary Boolean circuit i? by a DPG Gb code each gate of B separately 
by its own subgraph and draw edges from the output nodes of each gate to the 
appropriate input nodes (for details of this construction see [3]). The simulation 
of a gate-subgraph is based on testing the different possible input constellations. 
For this simulation the ALT-output mode is essential and it cannot be simulated 
using only the PAR-output mode. 

In this subsection we describe a different technique to transform a Boolean 
circuit into a DPG that requires only nodes with output mode PAR. Let B have 
n input variables. To construct a PAR-output DPG Qb we proceed in two steps: 

1. Gonstruct an equivalent circuit B' that contains only AND and OR gates 
(of fanin two), NOT gates are superffuous since B' receives as input also the 
negation of each variable. B' computes the same function as B and has the 
additional properties: 

a) the lengths of all paths from an input to an output are equal, 

b) on every input output path the AND and OR gates alternate. 

2. From B' generate Qb as follows: Let G = (V, E) be the graph of B' , then 
Gb ■= (y,E,I,0) with O = PAR. For u € V we choose I{u) := ALT iff the 
corresponding gate of u in B' is an OR -gate, else I{u) := PAR. 




Fig. 3. A Boolean circuit B, its DPG Gb,ci, and a run for an input xi = X 4 — 1 and 
X2 = X3 = Xs = 0. 



One can show that Gb executable and that for any communication delay 
S it holds: Topt{GB,S) < \B'\. 

To understand the functionality of Gb we will now modify Gbj such that the 
run of the resulting DPG simulates a computation of B on an arbitrary input 
vector a G {0, 1}”. 

Define the line DPG £ of length fc > 1 as a DPG consisting of k vertices 
ui,U 2 ,... ,Vk and k — \ edges: (vi,V 2 ), ■ ■ ■ , (vk-i,Vk)- The input and output 
modes are inessential. For completeness we set I{vi) = 0{vi) = PAR. Now let £ 
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denote the number of nodes of Gb and let k := 2t. We add n lines Ci,. . . , of 
length k to Gb such that Ci corresponds to the variable Xi as follows: li Xi = 1 
then we connect the sink node Vk,i of Li to the input node of Gb that corresponds 
to the literal Xi in If = 0 we connect Vk,i to the input node of Gb that 
corresponds to the literal Xi in B' . The resulting DPG will be denoted by GB,a- 
An example for such a DPG Gb,ci is shown in Fig. 3(b). 

Lemma 6. Let j/i, . . . ,ym be the output nodes of circuit B, let a he an input 
assignmet, and let /3 G {0, 1}™ he the output vector of B for input a. Then for 
every run H of Gb,oi and for any i € the following holds: 

f3{i) = 0 Vw G W{yi) : the subrun induced by w has size > 2£. 

Moreover there exists such a run for Gb,ol that for any sequence < i 2 < 

. . . < ik, with (3{ij) = 1, one can chose tasks Wi. G W{yi.) in Ha such that the 
subgraph induced by the tasks Wi^ and their predecessors has size bounded by i, 
he- \[ji<j<kT^{w,^)\ < 

Hence to test some output gates yi^, . . . , yi^ of B for a particular input assign- 
ment a one can check for the run Ha whether for some tasks of , . . . , the 
corresponding subgraph has size at most 

4.2 The Reduction 

Let B be an input of the SUGGINGT-3SAT problem, that means a circuit coding 
a Boolean 3GNF formula T . We may assume that T has exactly m = 2'^ clauses. 
Then there will be n = 2'^~^ variables. 

The first step of our reduction is to construct two circuits Ai and A 2 . The 
first circuit A\ verifies the syntax of the formula encoded by B. Speaking more 
formally, Ai with the input j returns 1 iff the given circuit B returns j on 
exactly three inputs (fc, k'), k") for all k G {0, 1} and fc', k” G {1, 2, 3}. 

Otherwise Ai returns 0. 

The second circuit A 2 will be used to check whether the encoded formula is 
satisfiable. A 2 with input (fc, j, i), is defined as follows. A 2 {k, j, i) returns (1, /3) 
if xf is the /c-th literal of the j-th clause, where denotes ->Xi and x\ means 
Xi- In any other case, A 2 returns (0, 0). Obviously, these circuits can easily be 
constructed for a given B. Let us denote the DPG that corresponds to circuit 
Ai by Ai- We will call A\ the syntax- verifier, and .42 the index- verifier 
graph. Because of lack of space only the syntax-verifier will be described in the 
following. 

Let us call the sources of Ai jti Jfi, jt 2 , jf 2 , ■ • • , jtdJfd- They encode the 
integer j as follows: If the Gth digit of the binary representation of j is I then in 
a run for A\ that represents j the node jt^ has an execution instance that can 
be computed within a certain time, while the execution instance of requires 
slightly more time. If the Gth digit of j is 0 then jfc can be computed fast and 
jte requires the larger amount of time. The unique instance of either jt^ or jfi 
that can be computed fast will be denoted by t{£). 
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Our first goal is to construct a DPG that represents the input values of j. 
More precisely, the run of this DPG represents all possible 2"^ values for j. The 
crucial point of the whole construction is then to couple this value genarator 
DPG with Ai in an appropriate way. 

To generate the binary representation of j we will use a DPG called a coun- 
ting graph T~Ck,ei,e 2 with parameters k,ii and £ 2 - Their main building blocks 
are subgraphs Ce := (V,E,I,0) with V = {u^, Uo,i) 1 < i < ^} U {uo}> 
E := Ui)\l < i < I, j G {0, 1}}, O = PAR, and I = ALT. Note 

that for each Ce if considered separately the sink process Ui has 2^ execution 
nodes. 

Using these subgraphs one can achieve the following property which will be 
important for the reduction. ^as 2k sink nodes, which will be denoted 

by t/J, yl, . ■ ■ , yl- For every run of TCk,ei,e 2 it holds that each yf has exactly 
2* execution tasks and for exactly 2^“^ of these tasks each corresponding subrun 
has size St := k{£i + 2) + + 4. For the remaining 2^“^ tasks each subrun has 

size Sf := St + {k — 1) ■ {£2 + 1) + 2. This means that for the suitable values 
£\ and £ 2 , exactly 2*“^ subruns have small size whereas the remaining subruns 
are much bigger. Furthermore, any task of a node yf is associated with a 
task of a node yl~^ such that |7^(tf)| = St iff \TZ(t]~^)\ = Sf. Finally, the 
construction guarantees that any processor can only compute one task for any 
pair 

Drawing edges (yf, j/i), (yf , Jii) from the sinks of the counting graph to the 
sources of Ai we obtain a DPG which has similar properties as the DPG Gb.o 
discussed in the previuous subsection. Note that the pair yf,yf of the counting 
graph specifies the value of t{i). Using Lemma 6 one can prove that for any 
input j the circuit Ai returns the value 1 iff for any value j (represented by the 
particular tasks there exists a subrun for the sink of Ai of moderate size, 

i.e. < k{£i + 2) + fc(fc^ + 4) + |Mi|. 

The DPG for checking whether the encoded formula is satisfiable is tech- 
nically more involved. This graph has to check whether the input values i, j, 
and k fit together (which is something like a local test) . Furthermore, it requires 
a mechanism to choose the value of the variables Xi and decide if these values 
satisfy each clause Cj (which is something like a global test) . The details can be 
found in the full paper. 

5 Conclusion 

We have shown that the computational complexity to determine the optimal 
schedule length T^pt for PAR-output DPGs is quite high, this problem is complete 
for AfSXVEXAiS. On the other hand, we have shown that the maximal size of 
a subrun of a minimal-overlapping run El gives a good approximation for Topt- 
However, this bound is not tight and an interesting problem is to give some other 
natural structural properties of runs which allow better approximations. 

Another interesting question is how efficiently a minimal-overlapping run can 
be constructed for a given PAR-output DPG Q. In this paper we have shown that 
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all such runs have the same size. Can we find a minimal-overlapping run H in 
linear time with respect to the size of HI Note that Lemma 1 and Lemma 2 in 
[3] imply that for certain DPGs the number of different runs can be exponential 
with respect to the size of H . Therefore, a brute-force searching will be inefficient. 
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Abstract. The bandwidth minimization problem on graphs of some 
special graph classes is studied and the following results are obtained. 

The problem remains NP-complete when restricted to splitgraphs. There 
is a linear time algorithm to compute the exact bandwidth of a subclass 
of splitgraphs called hedgehogs. There is an efficient algorithm to appro- 
ximate the bandwidth of circular permutation graphs within a factor of 
four. 

1 Introduction 

The BANDWIDTH problem for a graph is that of labeling its vertices with di- 
stinct integers so that the maximum difference across an edge is minimized. One 
of the original motives in graph algorithms was finding the most efficient way 
to represent the adjacency matrix of a graph. The problem is namely equivalent 
with finding a permutation of the vertices such that the non-zeros lie in a small 
band around the diagonal. For matrix problems in general this problem is of 
interest since it can clearly be used to accelerate matrix operations like multipli- 
cations and inversions. The problem was formulated for graphs in the beginning 
of the sixties. Heuristics for the bandwidth problem for graphs exist already for 
a long time, however their worst-case performance cannot be guaranteed (for an 
overview and further references see [3,4]). 

Nowadays, the bandwidth problem is standing in the midst of all kinds 
of new research areas because it has become a sort of test-case in the area of 
complexity analysis, approximation algorithms, and research into special classes 
of graphs. This is apparent by the long list of publications on the subject. 

The problem is NP-complete [18] and it remains NP-complete even for trees 
[7,16]. Concerning fixed parameter complexity, the problem is LF[t]-hard for 
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every t (see, e.g. [5]). This is seen as strong evidence that to test if a graph 
has bandwidth at most k is not fixed parameter tractable. On the positive side, 
there is an algorithm to test if a graph has bandwidth at most k with running 
time 0{n^) [8]. 

The BANDWIDTH minimization problem is also in the center of renewed inte- 
rest in the area of approximation algorithms. We mention some recent results. 
There is no polynomial time algorithm to approximate the bandwidth of a graph 
within a constant factor, unless P=NP [22]. An approximation algorithm for the 
bandwidth problem has been obtained using semi-definite programming [2] . Fur- 
thermore a polylogarithmic approximation algorithm has been given [6]. Much 
research has already been done on the approximation of the bandwidth for gra- 
phs in special classes of graphs. We mention a few of them. It has been shown 
that there is no polynomial time constant factor approximation algorithm for 
trees unless P=NP [22]. Approximation algorithms for special classes of trees 
have been given in [9,10]. For AT-free graphs a factor two approximation was 
obtained [13]. For split graphs a factor two approximation is known [23]. It 
has been shown that there is no polynomial time approximation algorithm for 
circular arc graphs with a factor less than two unless P=NP [22]. On the other 
hand, an efficient algorithm to approximate the bandwidth of circular arc graphs 
within a factor two is known [15]. 

Unfortunately only a few typically small graph classes are known to have 
a polynomial time exact bandwidth algorithm, among them caterpillars with 
hairs of length at most two [1], chain graphs [14], interval graphs [11] and theta 
graphs [20]. 

In this paper we consider the complexity of the bandwidth problem for 
different graph classes. We show that the problem remains NP-complete when 
restricted to splitgraphs. For a subclass of splitgraphs, called hedgehogs, we 
were able to find a linear time exact algorithm. As an easy consequence, we 
obtain a linear time 2-approximation algorithm for chordal graphs of diameter 
at most three (including all split graphs). Finally we use the above-mentioned 
approximation algorithm for circular arc graphs to obtain the first constant 
factor approximation algorithm for circular permutation graphs. 



2 The NP-Completeness for Splitgraphs 

In this section we show that the bandwidth problem remains NP-complete 
when restricted to splitgraphs. 

Definition 1. A layout L of G is a 1-1 mapping from V onto {1,...,|U|}. The 
width b{G,L) of L is zero, if G has no edges, and otherwise 

b{G, L) = max{|L(u) — L{v)\ : {u, u} G E} . 

The bandwidth of G is 



bw(G) = min{6(G, L) : L is a layout of G} . 
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We call a layout L a k-layout if its width equals k. We say that a layout L 
of G is optimal if b{G,L) = bw(G). 

The following lower bound for the bandwidth is easy to obtain, see e.g. [3]. 
Lemma 1. For every graph G = (V, E) we have 

bw(G) > max — , 

wcv diam(G[W]) 



especially 



bw(G) > max 



1^1 -1 ^(g) 

diam(G) ’ 2 



w(G) - 1 



Proof. Let L be an optimal layout of G and WCV. We consider the vertices 
w,z € W at leftmost and rightmost position in L among all vertices in W. By 
the pigeonhole principle there is an edge {x, y} on every path from w to 2 : in G 
such that \L{x) — L{y)\ > {\W\ — l)/(diam(G[W])), where diam(G) denotes the 
diameter of G. 

Especially we choose W to be the whole vertex set V, the neighbourhood 
N(v) of a vertex v of maximum degree augmented by v itself {\W\ = A{G) + 1) 
and a maximum clique of G {\W\ = w(G)). □ 



Definition 2. A splitgraph G = (G, I, E) is a graph with vertex set C U I and 
edge set E such that G is a clique and I an independent set in G. 

The complexity of the bandwidth problem for splitgraphs was open for a 
long time (see, e.g., [15,23]). To prove its NP-completeness we use a reduction 
from the bandwidth problem on cobipartite graphs, which is known to be NP- 
complete (see [13]). Recall that cobipartite graphs are just the complements of 
bipartite graphs. 

Lemma 2. For every cobipartite graph G = {X, Y, E) there is an optimal layout 
L such that L{x) < L{y) for every x G X and y GY . 

Proof. By Lemma 1 we know |X|, |Y| < bw(G) + l because X and Y form cliques 
of G. We add vertices to X and Y without neighbours in the other color class, 
such that the new sets X' and Y' satisfy \X'\ = |Y'| = bw(G) + 1 and call this 
new graph G'. Then bw(G) = tw(G) = tw(G') = bw(G'), because for cobipartite 
graphs bandwidth and treewidth are equal [19] and every tree-decomposition of 
G can easily extended to a tree-decomposition of G'. 

The new graph G' is cobipartite. In every optimal layout of G' the vertices 
in X' appear consecutively because |X'| = bw(G') -I- 1 as well as the vertices in 
Y' do. Hence we obtain a layout L of G with L{x) < L{y) for every x G X and 
y G Y if we restrict an optimal layout of G' with the analogous property. □ 



Theorem 1. The bandwidth problem is NP-complete for splitgraphs. 
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Proof. Consider a cobipartite graph G = {X, Y, E) with color classes X and 
Y and integer k. By Lemma 1 we may assume k > |X| — 1 and |X| > \Y\. 
First choose sets A and B of additional vertices such that \A\ = \X\ and \B\ = 
k + 1 — |X|. Starting from G we form a splitgraph H with clique B iJ X and 
independent set AUY. Therefore we add all edges {a, b} with a & A and b G B 
and remove the edges from AUY. The edges of G between X and Y remain 
unchanged. 

Let L be a fc-layout of G such that L{x) < L{y) for all x G X and y G Y. 
Then a fc-layout of H starts with vertices of A, followed by the vertices of B and 
then followed by the layout L of G. 

Next consider a fc-layout of H. By the cardinalities of A and B all vertices 
in the clique BU X are consecutive as well as all vertices of ^ U B. Without loss 
of generality, the layout is of the form A, B, X, Y. The restriction to X and Y 
is a fc-layout for G because fc > |X| > |F|, i.e., the edges inside Y missing in H 
have at most the length of the longest edge inside X. □ 

3 Bandwidth of Hedgehogs 

In this section we consider a subclass of split graphs. We present a formula for 
the bandwidth of hedgehogs and describe corresponding optimal layouts. 

Definition 3. A splitgraph G = (G, I, E) is said to be a hedgehog if every 
vertex of I has degree one. 

Let G = (G, /, E) be a hedgehog. An edge e = {c, x} of G is called a hair if 
c G G and x G I. The root (marked by o in forthcoming figures) of the hair e 
is c and the tip (marked by •) of e is x. Clearly every vertex in / is a tip, but 
not all vertices in G need to be roots. A vertex p G C that is not incident with 
a hair is called pore (□). For every vertex c G C we define M(c) = / fl iV(c), 
M[c] = {c} U M(c). Throughout this section let n = |G U /|. 

An hedgehog is called trivial if it has at most one root. Since every hedgehog 
is connected, trivial hedgehogs have diameter at most 2 and nontrivial hedgehogs 
have diameter 3. It is not hard to see that bw(G) = max {|G| — I, |"|Z\(G)] } for 
trivial hedgehogs. (Notice that A(G) = n — 1 for trivial hedgehogs.) 

Definition 4. A layout L : GUI — >■ {1, 2, . . . , n} of a hedgehog G = (G, /, E) 
is normal if L{a) < L{c) 4=^ L{x) < L{z) for every pair of hairs {a,x}, {c,z} 
with a ^ c and x,z G I. 



Lemma 3. For every hedgehog there is an optimal normal layout. 

Proof. Let L be an optimal layout of a hedgehog G = (G, I, E) that is not 
normal. Then there is a pair of hairs {a, x} and {c, z}, x, z G I, with L{a) < L{c) 
and L(x) > L{z). Changing positions of x and 2 we do not increase the width 
of the layout, but we decrease the number of such pairs of hairs. □ 
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Let L be a normal layout of G = (G, I, E). For each pair of integers s, t with 
1 < s < t < n, we call {v : s < L(v) < t} an interval of L. Then for every 
root c G C, the set M(c) either forms an interval of L or the smallest interval 
containing M(c) consists of M(c) and some vertices of G, but not other vertices 
of/. 

Now let L be a fixed normal layout of a hedgehog G = (G, I, E). We partition 
the tips into left tips (◄), these are vertices x G I with L{x) < min{L(c) : 
c G G}, middle tips (a), these are vertices x G I with min{L(c) : c G C} < 
L{x) < max{L(c) : c G C} and right tips (►), these are vertices x G I with 
max{L(c) : c G C} < L{x). Similarly we define left roots (<), these are roots 
adjacent to left tips only, middle roots (a), these are roots adjacent to middle 
tips only and right roots (t>), these are roots adjacent to right tips only. A vertex 
c G C that is not a left, middle or right root is either a pore (if M(c) = 0) or 
a parting. A left parting (<) is adjacent to left and middle tips, but not to right 
tips. A right parting (>) is adjacent to right and middle tips, but not to left tips. 
Finally, a centre parting (o) is adjacent to at least one left tip and one right tip. 
(See e.g. Fig. 1.) 

Lemma 4. Let L he a normal layout of a hedgehog G = (G, /, E) . Then there 
is at most one left parting, at most one centre parting and at most one right 
parting. If either a left or a right parting exists then there is no centre parting. 
If there is a centre parting then there is neither a left nor a right parting. 

Proof. Otherwise we have hairs {a, x} and {c, z} incident to partings a and c such 
that L{a) < L{c) < L{z) < L{x) or L{z) < L{x) < L{a) < L{c) contradicting 
the fact that L is normal. □ 



Definition 5. A normal layout L : G U / — >■ {1, 2, . . . , n} of a hedgehog G = 
(G, /, E) is said to be a sorted layout if the vertices appear in the following order: 
in the leftmost positions (from the left) first all left tips, then all left roots, then 
a left parting (if any); in the rightmost positions (from the right) first all right 
tips, then all right roots, then a right parting (if any). 



Remark 1. It is not hard to verify that every hedgehog has an optimal sorted 
layout (that we shall obtain as an immediate consequence of the construction of 
the optimal layouts in the proof of Theorem 2). 



Lemma 5. Let L he a sorted layout of a hedgehog G = (G,I,E). Let £ be the 
number of left tips in L and let r be the number of right tips in L. If L has no 
centre parting then b{G, L) = max{/, r,n — £ — r — 1}. Otherwise let Cm G G he 
a centre parting and let d he the maximum of L{cm) — min{L(a;) : x G 
and max{L(a;) : x G — L{cm). Then b{G, L) = max{£, r, n — / — r — 1, d}. 

Furthermore L{cm) — min{L(x) : x G M{c)} < £ if the centre parting is next 
to the rightmost left root and max{L(x) : x G M{cm)} ^ r if the centre parting 
is next to the leftmost right root in L. 
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Proof. Clearly an edge violating any statement of the lemma must be a hair. 

First we consider a vertex c G C which is either a left root, a left parting or 
a centre parting such that no pore is left of c in L. Let {c, t} be any hair such 
that t is a left tip. Let {a, z} be the hair with L{z) = 1 and L{a) = £+1. Since 
each vertex v oi C with L{v) G L(c)} is adjacent to at least one left 

tip we may conclude L{c) — L{t) < 1. 

Analogously we obtain L(t') — L(c') < r for each hair {c',t'} with t' right 
tip. Together this implies all statements of the lemma. □ 

Now we are ready to present the formula for the bandwidth of hedgehogs. 

Theorem 2. Let G = {C,I,E) be a hedgehog. Then 

bw(G) = max{|G| - 1, \\A{G)\ , [i(|C| + |/| - 1)] } . 

Proof. The formula holds for trivial hedgehogs, thus we may assume that G is 
a nontrivial hedgehog. Hence G has diameter 3. 

The lower bound follows from Lemma 1. 

To establish the upper bound we shall distinguish three cases depending on 
which of the three values in the formula is maximum. Nevertheless there is a 
general way of obtaining an optimal sorted layout that we shall describe first. 
(See Fig. 1 for an illustration.) 

To obtain a layout of I, place the sets M(c) for all roots c G G as interval 
Ic side by side where the order of the intervals is arbitrary. We shall cut the 
obtained I-layout (between two consecutive tips) at two positions that we call 
left eut and right cut (and that may coincide). 

Then all tips left of the left cut will be the left tips and all tips right of the 
right cut will be the right tips of the final optimal layout. All other tips will be 
middle tips. Middle tips and all vertices of G will be placed between the left cut 
and the right cut. Notice that there is a centre parting Cm in L iff left and right 
cut both divide the same interval Ic^. 

Since we construct a sorted layout L, all left roots, left partings, right partings 
and right roots and their position in L is fixed by the /-layout and the two 
cuts. Notice that the positions of middle roots, middle tips and pores are of no 
importance for the width of L by Lemma 5. It remains to fix the position of the 
centre parting in L if there is one. The feasible positions for Cm are right of all 
left tips and all left roots and left of all right roots and all right tips. We shall 
show that we can always place the centre parting in a feasible position of L such 
that L is optimal. 

Now we distinguish three cases. For each case we show how to choose the 
left and the right cut and that the constructed layout L has the requested width 
and is optimal. 

Case A: [iZ\(G)] > max {|G| — 1, |"|(n — 1)] } 

We choose the left and the right cut of the /-layout such that the final layout 
L has I = [^Z\(G)] left tips and r = |"iZ\(G)] right tips if 2 [iZ\(G)] < |/| (i.e. 
there are enough tips). Otherwise we choose t= [ 5 I/I] and r = [^l/lj- Thus in 
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both cases [^Z\(G)] > n — £ — r — 1 either since [^Z\(G)] > 1 )] or since 

|"|Z\(G)] > |G| — 1. Thus, if L has no centre parting then b{G,L) < |"iz\(G)] 
by Lemma 5. Consequently L is optimal. 



clique 

independent set 



layout 



o o □ o 



lA(G)/2] p|-2.rA(G)/21 

r 



\A(G)/2-] 












1 n > > ► ► ► ► ► ► ► ► 



Fig. 1. Case A 



Suppose Cm is the centre parting of L, i.e. the left and the right cut divide 
the interval Ic^. Placing Cm in the leftmost feasible position the distance to 
its leftmost neighbor in L is at most £ < |"|Z\(G)] either since L is sorted or, 
if there is no left root, since L{cm) = £ + 1- Analogously, placing Cm in the 
rightmost feasible position the distance to its rightmost neighbor in L is at most 
r < [^Z\(G)] either since L{cm) = n — r (i.e. no right root) or since the upper 
bound r of the distance is guaranteed since L is a sorted layout. 

Consequently there is a feasible position of Cm such that the distance to its 
leftmost neighbor and the distance to its rightmost neighbor in L are both at 
most [i/\(G)]. Placing c„ in such a position we obtain a layout L of width at 
most |"iZ\(G)]. Consequently L is optimal. 

Case B: |G| — 1 > max { [iZ\(G)] , [|(n — 1)] } 

We choose the left and the right cut of the /-layout such that the final layout 
L has £ = [ 5 I/I] left tips and r = [ 5 I/IJ right tips. Clearly n — £ — r—l = |G| — 1. 
Notice that |G| — 1 > [^(IGI -I- |/| — 1)] implies [ 5 I/I] < |G| — 1. Consequently 
if L has no centre parting then b{G,L) < |G| — 1 by Lemma 5 and thus L is 
optimal. 



clique 

independent set 



layout 



I o o o □ o 




rid/21 
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Fig. 2. Case B 



Suppose Cm is the centre parting of L. Analogously to Case A, placing Cm 
in the leftmost feasible position the distance to its leftmost neighbor in L is at 
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most £ < IC'I — 1 since L(cm) = ^ + 1 or since L is sorted. Similarly, placing Cm 
in the rightmost feasible position the distance to its rightmost neighbor in L is 
at most r < |C| — 1. Consequently there is a feasible position of such that its 
distance to its leftmost neighbor and its distance to its rightmost neighbor are 
both at most [^Z\(G)] < \C\ — 1. We place Cm in such a position and obtain an 
optimal layout L. 

Case C: — 1)] > max {|C| — 1, |"^Z\(G)] } 

We choose the left and the right cut of the /-layout such that the final layout 
L has £= |"|(n — 1)] left tips and r = |"|(n — 1)] right tips. If there should not 
be enough tips we choose either one, say £, or both, £ and r, equal to — 1)J . 
Consequently n — £ — r— 1< [|(n — 1)]. 



clique 

independent set 
layout 



o o n o 



r(n-l)/31 



r(n-l)/31 



^rfX 
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-< -< <1 AO A n > ► ► ► ► ► 



Fig. 3. Case C 



Thus, if L has no centre parting then 6(G, L) < — 1)] by Lemma 5 and 

L is optimal. If there is a centre parting then analogously to the cases A and 
B we obtain that there is a feasible position of Cm such that the distance to its 
leftmost neighbor is at most £ and the distance to its rightmost neighbor is at 
most r. We place Cm in such a position and obtain an optimal layout L. □ 

Corollary 1. There is a linear time algorithm for hedgehogs that computes the 
bandwidth and an optimal layout. 

4 Consequences for Bandwidth Approximation 

The following tool for the design of bandwidth approximation algorithms for 
special graph classes has been given in [13]. 

Lemma 6. Let G and H he graphs with G C H C G‘^ or H C G C i/'^ for an 
integer d > 1, and let L he an optimal layout for H, i.e., b{H,L) = hw{H). Then 
L approximates the bandwidth of G by a factor of d, i.e., b{G,L) < (i-bw(G). 

Proof. First we assume G C H C G^^ . Then 

b{G, L) < b{H, L), since G C H 

= bw(i/), since L is optimal for H 

< hw{G<^), since // C G”^ 

< d ■ bw(G) . 
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Now lei H <ZGCH<^. Then 



b{G,L) < b{H‘^,L), 
<d-b{H, L), 
= d ■ hw{H), 
< d ■ bw(G), 



since G C H‘^ 

since L is optimal for H 
since H C G. 



This proves the lemma. □ 

In a general fashion its consequences might be formulated as follows. 

Lemma 7. Let Q and H be graph classes and d > 1 an integer. Assume there 
is an s{n) algorithm to approximate the bandwidth of graphs in class H within 
a factor of c. 

Suppose there is either a t{n) algorithm to compute for a given graph G € G 
a graph H G TL such that G C H C G‘^, or there is a t{n) algorithm to compute 
for a given graph G G G a graph H G TL such that H C G C . 

Then there is an s{n)+t{n) algorithm to approximate the bandwidth of graphs 
in class G within a factor of c- d. 

Let G = {G, I, E) be any splitgraph. A hedgehog H = {G, I, F) with the same 
clique G and same independent set I spans G if F Q E. A spanning hedgehog of 
G is easy to find. Obviously we have H C G C for every spanning hedgehog 
iJ of G. By Lemma 6 and Lemma 7 respectively, combined with Corollary 1 we 
obtain 

Corollary 2. There is a linear time algorithm to approximate the bandwidth of 
splitgraphs with a factor of 2. 

We mention that a simple and different linear time 2-approximation algo- 
rithm for split graphs was obtained in [23]. 

Observing that any graph with a dominating clique, i.e. a clique G such 
that every vertex outside G has a neighbor in G, has spanning hedgehog, we 
can slightly extend our 2-approximation algorithm to chordal graphs having a 
dominating clique. Using a theorem of [12] stating that a chordal graph has a 
dominating clique iff it has diameter at most three, we obtain 

Corollary 3. There is a linear time algorithm to approximate the bandwidth of 
chordal graphs of diameter at most three within a factor of 2. 

5 Circular Permutation Graphs 

In this section we use the technique of Lemmas 6 and 7 to obtain an efficient 
algorithm to approximate the bandwidth of circular permutation graphs within 
a factor of four. 

A family M = {oy : v G V} oi subsets of a universal set U is an intersection 
model of the graph G = (V,E) if E = {{x,y} : Ox fl Oy ^ 0}. A graph G is 
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an intersection graph if it has an intersection model. In this section we consider 
classes of intersection graphs defined by special intersection models. 

Interval graphs are the intersection graphs of intervals on the real line. The 
exact bandwidth of an interval graph is computable in polynomial time [11, 
21]. Circular arc graphs are the intersection graphs of circular arcs on a circle. 
Obviously, every interval graph is a circular arc graph. The following result was 
proved in [15]. 

Theorem 3. The bandwidth of a circular arc graph can he approximated to 
within a factor of four in 0{n) time, and to within a factor of two in 0{nlog^ n) 
time. 

Permutation graphs are the intersection graphs of straight line segments bet- 
ween two parallel lines. Circular permutation graphs generalize permutation gra- 
phs in the same way as circular arc graphs generalize interval graphs. 

We consider a finite family of spiral segments in the annual region between 
two concentric circles in the plane such that any two segments have at most one 
common point. The graphs defined by such intersection models form the class 
of circular permutation graphs [17]. 

In the following we consider a circular permutation graph G = (V, E) that 
is not a permutation graph. For a fixed intersection model {s„ : v G V} of G we 
construct a circular arc graph H = (V, F) such that G C H C G^. 





Fig. 4. The spiral segments of a chord- 
less dominating cycle C 12 in a circular 
permutation diagram. 



Fig. 5. The folded circular arc model 
is indicated by the dashed line in the 
circular permutation diagram. 



Therefore we fix a chordless dominating cycle {v\,ui,V 2 ,U 2 , ■ ■ ■ ,vi,ui) of G. 
The segments define a partition of the annular region into I sections. The 
border of such an section consists of the segment Sy., the circular arc li on the 
inner circle, the segment and the circular arc Oi on the outer circle. Note 
that either for all z = 1, . . . , /, the endpoints of the segment belong to li+i 




Bandwidth of Split and Circular Permutation Graphs 253 



and Oi-i, or for alH = 1, ... ,1, the endpoints of the segment belong to li-i 
and Oi+i- Without loss of generality we assume henceforth the first case. 

We form a circle C from the arcs I\, 0\, O^, . ■ . h and Oi in this circular 

order. Let liOi and Oili+i denote the common points on C of the arcs li and 
Oi and of the arcs Oi and h+i, respectively. For every vertex v & V 'we define 
a circular arc ay on C as follows: The endpoints of ay are the endpoints of the 
segment .Sy. If v = Vi for an index i, then the arc ay contains exactly one point, 
namely Oi-iU. If w G N{vi) then ay contains Oi-ili and for v € N[ui] the arc 
ay contains the point liOi. 

Let H = (V,F) be the circular arc graph defined by the intersection model 
{a„ : u G V}. 

Lemma 8. G C H C . 

Proof. First we consider an edge {x, y} G E. Let i be the index such that the 
common point of Sx and Sy belongs to the section between Sy^ and Sy^J^^ (inclu- 
ding If a: G A^[ui] and y G then {Oi-ili} G Ua, fl ay. If x ^ and 

y ^ N[vi] then {liOi} G axtlUy. Hence we may assume x G and y ^ iV[wj]. 
This implies that ax and ay have a point of li in common. Hence, in all cases 
we have {x, y} G F. 

Now let {x, y} G F. Both ax and ay contain a point liOi or OiL+i for a 
suitable index i. For liOi & ax ay we have Ui G N[x] fl N[y]. Oili+i G Oa, fl Oy 
implies Ui G iV[a;] fl iV[ 2 /]. liOi G ax \ ay and Oili+i G ay \ ax is impossible 
because {x, y} G F. Consequently, x and y are adjacent in G or have a common 
neighbour. □ 

Theorem 4. The bandwidth of a circular permutation graph can he approxima- 
ted to within a factor of four in 0{nlog^ n) time. 

Proof. By Lemmas 6, 7 and 8 and Theorem 3. □ 
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Abstract. We consider the problem of recognizing AT-free graphs. Alt- 
hough there is a simple 0(n®) algorithm, no faster method for solving 
this problem had been known. Here we give three different algorithms 
which have a better time complexity for graphs which are sparse or have 
a sparse complement; in particular we give algorithms which recognize 
AT-free graphs in 0{nfn + n^), -I- n^), and + nm). In 

addition we give a new characterization of graphs with bounded astero- 
idal number by the help of the knotting graph, a combinatorial structure 
which was introduced by Gallai for considering comparability graphs. 



1 Introduction 

An asteroidal triple or, briefly, an AT of a given graph G is a set of three 
independent vertices such that there is path between each pair of these vertices 
that does not contain any vertex of the neighborhood of the third. Consequently, 
a graph G is called asteroidal triple-free or AT-free if there is no asteroidal triple 
in G and it is called coAT-free if G is AT-free. 

Almost forty years ago, Lekkerkerker and Boland [13] defined the concept of 
an asteroidal triple for the first time. They used it for the investigation of intersec- 
tion graphs corresponding to intervals of the real line — the interval graphs — and 
proved the well known characterization, that a graph G is an interval graph if 
and only if it is chordal and AT-free. Already in this early paper Lekkerkerker 
and Boland consider the problem of deciding whether a given graph contains 
an asteroidal triple. In fact, they gave a simple O(n^) algorithm — we call it 
Straightforward Algorithm in the following — and used it for recognizing 
interval graphs. Of course by now there are much faster algorithms for recogni- 
zing interval graphs. However, for deciding whether a given graph contains an 
asteroidal triple no faster algorithm had been known. 

In this paper we study the recognition problem of AT-free graphs from diffe- 
rent perspectives and present three different recognition algorithms for AT-free 
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graphs. At first we examine the above-mentioned Straightforward Algo- 
rithm a bit closer and design an algorithm that runs in 0{nm -I- n^), where 
m is the number of non-edges of the input graph G. For the second recogni- 
tion algorithm we use an algorithm for listing all triangles of a given graph and 
achieve a time bound of + ri^). Finally, in the last section we present 

the Knotting Graph Algorithm. It makes use of a characterization of AT- 
free graphs by the help of the knotting graph and recognizes AT-free graphs in 
0{nm + 

Since AT-free graphs are defined as a generalization of interval graphs it 
seems to be plausible to try similar methods for recognizing AT-free graphs as 
proved useful for recognizing interval graphs. However, for different reasons non 
of the fast interval graph recognition algorithms seems to help for our purpose: 

Booth and Lueker [2] designed the first linear time recognition algorithm for 
interval graphs making use of the vertex-maximal clique matrix of the input 
graph. A vertex-maximal clique matrix is a 0 — 1 matrix M, such that each row 
of the matrix corresponds to a vertex of the graph and each column corresponds 
to a maximal clique of the graph and an entry rriij is 1 if and only if vertex i 
is contained in the maximal clique j. Fulkerson et al. [6] showed, that a graph 
G is an interval graph if and only if the vertex-maximal clique matrix M of G 
has the consecutive ones property for rows, i.e., if there is a permutation of the 
columns of M, such that no ones in a single row are separated by zeroes in that 
same row. For AT-free graphs we do not have such a strong characterization. 
Of course, there is a close relationship between AT-free graphs and interval 
graphs, since every interval graph is AT-free and every minimal triangulation 
of an AT-free graph is an interval graph [15]. By Parra’s characterization of 
minimal triangulations (see [16]) there is a one-one correspondence between the 
minimal triangulations of a given graph G and the inclusion maximal sets of 
pairwise parallel minimal separators of G. For AT-free graphs, this implies that 
for each set of pairwise parallel minimal separators the corresponding vertex- 
minimal separator matrix has the consecutive ones property for rows. However, 
this does not give a characterization of AT-free graphs yet and, even worse, the 
number of minimal separators of an AT-free graph can be quite large — it can 
even be exponential in the number of vertices of G. Hence, it is not very likely 
that one can use a method which is similar to the consecutive ones testing for 
the recognition of AT-free graphs. 

The algorithm of Booth and Lueker can be interpreted also as one that makes 
use of the geometric model of interval graphs, since a vertex-maximal clique 
matrix that has the consecutive ones property for rows provides an interval 
model of the corresponding graph. Again, for AT-free graphs such an approach 
is not applicable, since there is no known geometric model for AT-free graphs. 

A different method for recognizing interval graphs, was recently suggested 
by Cornell et al. [3] by applying a simple four-sweep LBFS-algorithm for this 
problem. They make especially use of the characterization of interval graphs to 
be chordal and AT-free, where the first of these properties was known to be 
checkable using LBFS in linear time before (see [17]). The key property that 
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Corneil et al. make use of, is the existence of a so-called interval ordering, i.e., a 
linear ordering vi, ... , of the vertices of the graph with the property that for 
each edge Vj) € E with i < j, all vertices Vk with i < k < j are adjacent to Vj. 
This ordering characterizes interval graphs (see [5] ) . A couple of nice properties 
of LBFS are known for AT-free graphs as well (see [4]) and several researchers 
have considered the problem of recognizing AT-free graphs by the help of LBFS 
or at least similar methods like LBFS. However, up to now no fast algorithm 
using this approach is known. One of the main reasons for the difficulty of any 
such method is, that there is no known linear ordering that characterizes graphs 
without asteroidal triples. 

Thus, none of the mentioned approaches seems to be applicable for reco- 
gnizing AT-free graphs. In spite of this dejecting observation there are, in fact, 
efficient methods to handle AT-free graphs as you will see in the following sec- 
tions. But all three presented recognition algorithms for AT-free graphs that we 
give here do not achieve a linear time bound. Recently, Spinrad showed, that it is 
rather unlikely to find a much faster algorithm to recognize AT-free graphs. He 
gave a construction for comparing the complexity of the recognition of asteroidal 
triple-free graphs to the complexity of finding an independent triple and showed: 
Given an algorithm that recognizes AT-free graphs in 0(/(n, m)) then there is 
an 0{f{n,m) + n^) algorithm for deciding whether a given graph contains an 
independent triple. (Due to space restrictions we cannot give this construction 
here but refer the reader to [18].) Hence it is unlikely to find algorithms for re- 
cognizing AT-free graphs which are much faster than the known algorithms for 
finding triangles. By a similar argument as used in Spinrad’s construction. Hem- 
pel and Kratsch [10] showed that already the recognition of claw-free AT-free 
graphs is as hard as finding an independent triple in a given graph. The fastest 
known algorithm for finding triangles in a graph is matrix multiplication with a 
time bound of 0(n“), with a < 2.376. Hempel and Kratsch also gave an 0{n°‘) 
algorithm for recognizing this restricted subclass of AT-free graphs. 

In the following we will denote the set of all neighbors of a vertex u in a graph 
G by Nc{v) and Nc[v] = Na{v) U {?;} (if no ambiguities are possible we omit 
the subscript). For a set of vertices S' of a graph G we denote by G[S] the graph 
induces in G by the vertices of S and we denote with G — S the graph G[K \ S]. 
For a graph G we use n for the number of vertices and m for the number of 
edges; m is used for the number of edges of G, the complementary graph of G. 

Because of space restrictions some of the results in this extended abstract are 
stated without proof; the interested reader can find the omitted parts in [12]. 



2 Straightforward Algorithm and Its Improvement 

There are several characterizations for asteroidal triple-free graphs and, as we 
will examine in the course of this paper, some of them are more useful for the 
recognition of this graph class than others. A very simple characterization is 
already given by a slight alteration of the definition of AT-free graphs. For a 
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graph G and vertices v, w of G, let G'’{w) be the connected component of 
G — N[w] containing vertex w. 

Observation 1. An independent triple u, v, w of a graph G forms an asteroidal 
triple of G if and only if G^{u) = G'"{w) and C“(w) = G'^{w) and C^(v) = 
C^(u) holds. 

If such a triple {u, v, ic} exists we know that u and w are in the same connec- 
ted component of G — N[w], in other words there is an m, ic-path, that avoids 
the neighborhood of v. Analogously, there is a w,r(;-path avoiding the neighbor- 
hood of u and an M,w-path avoiding the neighborhood of w. Thus, u,v,w is an 
asteroidal triple of G. 

This straightforward characterization immediately implies a Straightfor- 
ward Algorithm; it was first suggested by Lekkerkerker and Boland in [13] 
when they constructed an 0(n"^) algorithm for recognizing interval graphs. In a 
first step, for each vertex v the connected components of G — N[t] are determined 
and each is assigned a different label. Then, in the second step, for each triple 
of vertices it is checked whether the condition of Observation 1 is fulfilled. 

For implementing this algorithm one can use a simple data structure called 
eomponent structure which we define here for later use. For a given graph G = 
{V, E) with n vertices, the component structure of G is an n x n matrix G, where 
each column and each row of G corresponds to a vertex of G. For each vertex 
V the matrix entry Cyw is 0, if ic G N[u], otherwise is set to the label of the 
connected component of G — N[u] containing w. 
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Fig. 1. Example of a graph together with its component structure 



Example 1. Let G be the 7-vertex graph of Figure 1. The corresponding com- 
ponent structure is given in the table. The labels of the connected components 
are the letters a to k. Note that a component that occurs for different vertices 
gets different labels; e.g. the component containing only vertex 4 occurs both for 
vertex 3 and vertex 6 and thus has both label d and label i. 

The first part of the Straightforward Algorithm can be implemented 
to run in 0{nm) by applying an 0(n -I- m) breadth-first search to G — N[w] 
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for each vertex v of G. Using the component structure, the second part of the 
algorithm can easily be implemented to run in O(n^), by checking for each triple 
u,v,w of G whether c^v = c„m, c„„ = and = Cwv- Hence, AT-free graphs 
can be recognized in O(n^) using the Straightforward Algorithm. 

Obviously, the “bottleneck” of the complexity of the Straightforward 
Algorithm seems to be the checking of all possible triples of the given graph, 
and it is a reasonable question to ask, whether all those tests are indeed required. 
Actually, it is not really necessary to do this for all triples of the graph, because, 
when searching for asteroidal triples, we are interested only in independent tri- 
ples. Thus, as a first step, it is sufficient to check for all non-edges {v, w) of G, 
whether there is a vertex u such that the triple u, v, w is an asteroidal triple. Ob- 
viously, there are 0{nm) those triples in G, where m is the number of non-edges 
of G. A look back to the complexity of the first part of the Straightforward 
Algorithm tells us that we did not really get a better time complexity yet, 
since 0{nm) + 0{nWi) is still O(n^). However, also this first part of the algo- 
rithm can be altered to run in 0{nm), leading to an 0{tt? + nm) algorithm, as 
we show in the following. 

The method that we use for this improvement is a BFS conducted on G. 
McConnell [14] observed, that one can implement a BFS in such a way that it 
constructs the BFS-layers for G and runs in 0(n -I- m) time, where m is the 
number of edges of G — not of G (see also [11]). 

To achieve a BFS ordering on the complement by the help of partition refi- 
nement, we just have to change the ordering of T\ and T 2 , i.e., T 2 is placed in 
front of T\ in the ordering of the sets of the partition. 

Putting together the BFS on G and the improved checking of triples, leads 
to the Complement Algorithm. 

Theorem 1. Recognition of AT-free graphs, using the Complement Algo- 
rithm, takes 0{n^ nfn) time, where fn is the number of non-edges ofG. 

Proof. Computing the complement of a graph G can, of course, be done in O(n^). 
To compute the component structure, by the help of the complement-BFS takes 
0{n-\-m) for each vertex; hence, for all vertices it can be done in 0{nm). Finally, 
checking for each non-edges (v,w) and each vertex u, whether u,v,w form an 
AT, takes 0{nm) as well. □ 

3 Triangle Algorithm 

Before stating the next algorithm for recognizing AT-free graphs we first consider 
a different problem, the problem of finding triangles in a given graph. As we will 
see later, it turns out to be closely related to our recognition problem. Here 
we have to distinguish between the problem of deciding whether a given graph 
contains a triangle and the problem of finding one or all triangles of G. For the 
beginning we are interested in the second of these problems, i.e., we want to list 
all triangles of a given graph G. For this we make use of an observation made 
by Gabow, concerning the number of different triangles in a graph G, i.e., the 
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number of triangles of G, such that each pair of those triangles differs in at least 
one vertex. 

Lemma 1 (Gabow [7]). Given a graph G with m edges, there are at most 
different triangles. 

For listing all triangles of G in time, we can use the following algo- 

rithm. First the vertices are ordered, according to their degree. Then for each 
vertex Vi a list L„. of all neighbors of Vi with equal or larger degree that occur 
after Vi in the ordering is created. Within this list the vertices are ordered ac- 
cording to the ordering that was determined in the first step. In the last part 
of the algorithm for each vertex Vi the corresponding list L„. is traversed and 
for each vertex Vj in the lists Ly. and Ly^ are compared. For each vertex Vk 
that is contained in both lists the algorithm outputs the corresponding triangle 

Vi, Vj , U/p. 

Lemma 2. For a given graph G a complete list of all different triangles of G 
can be determined in 0(rr?l’^ + n). 

Similarly as in Lemma 1, one can bound the number of different triangles of 
a graph G also by the help of L\(G), the maximum vertex degree of G. 

Lemma 3. In a graph G with m edges and maximum degree A{G) there are at 
most 0{A{G) m) different triangles. 

Of course, this bound can also be used for the algorithm, that lists all triangles 
of G. 

Corollary 1. For a given graph G a complete list of all different triangles of G 
can be determined in 0{A{G) m). 

In Section 2, exploring the structure of the complement of a graph G seemed 
to be helpful for deciding whether G contains asteroidal triples. In the following 
algorithm — the Triangle Algorithm — we consider again G, this time using 
the set of its triangles. 

Theorem 2. Recognition of AT -free graphs, using the Triangle Algorithm 
(Algorithm 1), takes -I- n^) time, where n is the number of vertices and 

Wi is the number of non-edges ofG. 

Using the results of Lemma 1 and Lemma 3, we can also give a recognition 
algorithm that runs in 0{A(G)rh-\-n‘^), where A(G) is the maximum degree of 
G. Hence we have the following corollary. 

Corollary 2. Recognition of AT- free graphs, using the Triangle Algorithm, 
takes 0(n^ -I- min{m^/^, Z\(G)} m) time, where n is the number of vertices, m is 
the number of non-edges of G and A{G) the maximum degree ofG. 
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Algorithm 1: Triangle Algorithm 

begin 

Compute G; 

Determine a list L of all different triangles of G; 

for V G G do 

find all edges of G[Nq(u)] using L; 

compute connected components of G[Nq(u)] = G — N[u]; 
store labels of components in component structure G; 

end 

for each triangle u, v, w of L do 
I check whether u, v, w is an AT; 

end 

end 



Remark 1. As part of the Triangle Algorithm we used an algorithm that 
lists all triangles of the complement of G in a certain time and then runs in the 
order of the size of the set of triangles (plus O(n^) for initializing the component 
structure). Thus, if we are given the set of all independent triples of G and this 
set has size |T|, then we can check in 0(|T| + n^) time whether G is AT-free. 
If we take, for example, a graph G such that G is planar, we can find the list 
of triangles of G in O(n^), implying that we can decide in O(n^) whether G is 
AT-free. 

The time bounds of both the Triangle Algorithm and the Complement 
Algorithm are measured in the size of G. Another way to look at these algo- 
rithms is to consider them to be algorithms to recognize coAT-free graphs. In 
other words, we have an 0(nm), an + n^) and an 0{A{G) m + n^) 

algorithm which decides the problem: Given a graph G, is G a coAT-free graph? 

For recognizing AT-free graphs the term for the above algorithms was not 
avoidable, since both the edges and the non-edges of G were used during the 
algorithms and for every graph G, either m or m is in the order of . For reco- 
gnizing coAT-free graphs we can do better. All we have to consider are the edges 
of G whereas the non-edges of G are not of any interest. The way the algorithm 
is given above, we need O(n^) for initializing the component structure G. We can 
get rid of this by the following method: Instead of storing the information about 
the connected components of the neighborhood in the component structure, we 
store this information “within” the edges of the graph. For a vertex v we store 
the label of the connected component of G[Ng[w]] containing some vertex u to- 
gether with the edge (u, u). Obviously, for every edge only two labels are stored. 
To check, whether there is an asteroidal triple in G we just have to check for each 
triangle of G, whether the edges of the triangle have pairwise the same label for 
their common incident vertex. Consequently, we have the following theorem. 

Theorem 3. For coAT-free graphs the recognition problem can he solved in 
0(min{m^/^, Z\(G)} m) time, where m is the number of edges and A{G) the 
maximum degree of G. 
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4 The Knotting Graph 

When studying AT-free graphs one learns to appreciate the strong relationship 
between AT-free and comparability graphs. It was proved by Golumbic, Monma, 
and Trotter in their 1984 paper on tolerance graphs [9], that a graph G is a 
cocomparability graph then G contains no asteroidal triple. In fact, they were 
not the first to realize this relationship. A closer look at the paper of Gallai [8] 
on transitively orientable graphs reveals, that he already achieved this result 
almost twenty years earlier. In fact, he proves a much stronger result. To state 
his theorem we have to define one more concept. 

Definition 1. The sequence a = x\, Pi,X 2 ,P 2 , . . . , X 2 k+i, P 2 k+i,x\, {k > 1) 

is said to he a (2 fc-|-l) -asteroid of a graph G, ifxi , . . . , X 2 k+i are different vertices 
of G and Pi are Xi,Xi+i~paths ofG, such that for each Xi (1 < i < 2k + 1) there 
is no neighbor of Xi on the path Pi+k where the paths Pa and P^ are assumed to 
be equal for a = j3 (mod 2k + 1). 

Not surprisingly, every 3-asteroid contains an asteroidal triple, as one can 
derive from the definition. Examples of {2k + l)-asteroids for k > 2 are all 
complements of chord-less 2fc-|-l cycles for k > 2. A complete list of all irreducible 
{2k + l)-asteroids for fc > 2 which do not contain any asteroid of smaller length 
was given in [8]. Now we are able to state the result of Gallai. 

Theorem 4 (Gallai [8]). A graph G is a comparability graph if and only if G 
does not contain a {2k + l)-asteroid for k > 1. 

For the proof of his result Gallai makes use of a certain structure called the 
knotting graph ( Verkniipfungs graph) of G. 

Definition 2. For a given graph G = {V, E) the corresponding knotting graph 
is given by K[G] = {Vk, Ek) where Vk and Ek are defined as follows. For each 
vertex v of G there are copies vi,V 2 , . . . ,Vi^ in Vk, where iy is the number of 
connected components ofN{v), the complement of the graph induced by N(u). 
For each edge {v,w) of E there is an edge {vi,Wj) in Ek, where w is contained 
in the connected component ofN{v) and v is contained in the connected 
component o/N('u;). 



Example 2. In Figure 2 one can see a graph G together with its knotting graph. 
Here small dots in the knotting graph that are drawn closely together indicate 
that they are copies for the same original vertex of the graph. 

For the purpose of characterizing comparability graphs, i.e., the class of gra- 
phs which have a transitive orientation, the knotting graph has special impor- 
tance, as shown in the following theorem. 

Theorem 5 (Gallai [8]). A graph G is transitively orientable if and only if 
K[G] is bipartite. 
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Fig. 2. Example of a graph, together with its knotting graph. 



What makes this graph interesting for us is the close connection between 
the asteroidal sets of a graph G (a generalization of asteroidal triples) and the 
cliques of the knotting graph of G. Before we can state this result we first have 
to define the concept of an asteroidal set and the asteroidal number of a graph 
G. 

Definition 3. For a given graph G, an independent set of vertices S is called 
asteroidal set if for each x € S the set S — {x} is in one connected component 
of the graph G — N[x]. The asteroidal number of a graph G is defined as the 
maximum cardinality of an asteroidal set ofG, and is denoted by an(G). 

Theorem 6. Let G be a graph, then an(G) = w(K[G]). 

Proof. Let an(G) = k and let A = {ai, . . . ,ak} be an asteroidal set of G. By 
the definition of asteroidal sets the vertices of A are pairwise independent. Con- 
sequently, A induces a clique in G and for each at £ A the set T \ {ui} is 
contained in the neighborhood of Oi in G. Since A is an asteroidal set, for each 
aj,Uk £ ^ \ {fli} (j yf k) there is an a^, a^-path in G that avoids the neigh- 
borhood of Oi- Therefore aj and Ok are in the same connected component of 
G — N[oi]. By the definition of the knotting graph this implies that the knotting 
graph edges corresponding to the edges (oi, aj), (oj, Ofc) of G are incident to the 
same copy of Oi in the knotting graph. Since this is true for all pairs of vertices 
in A \ {oj}, all edges corresponding to edges from vertices of A \ {oi} to at in G, 
are incident to the same copy of at in the knotting graph. Consequently, there 
is a fc-clique in K[G] formed by copies of vertices of A. 

Now suppose there is a /c-clique in the knotting graph K[G]. Since there is a 
1-1 correspondence between the edges of G and the edges of K[G] there is a set 
A = {«!, . . . ,afc} of k vertices of G corresponding to the vertices of the clique 
in K[G]. By the definition of the knotting graph, for each vertex Oi £ A the 
vertices of A\{ai} are contained in the same connected component of G — N[oi]. 
Consequently, A is an asteroidal set of G. □ 

For AT-free graphs we can draw the following corollary. 

Corollary 3. A graph G is asteroidal triple-free if and only z/K[G] is triangle- 
free. 
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Because of the close relationship between an AT-free graph G and the knot- 
ting graph K[G] of the complementary graph G, we will sometimes call K[G] the 
knotting graph corresponding to the AT-free graph G. 

At this point we would like to mention some questions that arise from con- 
sidering {2k + l)-asteroids. As we have seen, AT-free graphs are exactly those 
graphs, that do not contain a 3-asteroid. On the other hand, cocomparability 
graphs are those graphs that do not contain any {2k + l)-asteroid. It seems to 
be an interesting question to consider those graph classes, that are defined by 
forbidding only certain kinds of asteroids. For example one could develop a whole 
hierarchy of graph classes that are superclasses of cocomparability graphs and 
subclasses of AT-free graphs. 

Another way of generalizing this concept is to consider the definition of 
asteroids again. This definition has the unsatisfying property that it exists only 
for odd numbers. Thus, it seems to be natural to ask whether one can state 
a reasonable definition also for even asteroids. The definition of Gallai implies, 
that an asteroid of a given graph G corresponds to a cycle in the knotting graph 
of G. This can be used for our purpose in the following way. We define a k- 
asteroid as a sequence a = xi, ... ,Xk {k > 3) of different vertices of G, such 
that for each vertex Xi there is a path between Xi_i and Xi+i in G — N[xj]. This, 
in fact, does cover the definition of odd asteroids, although the new numbering 
is different to the one used in the definition of Gallai. We leave it as an open 
question to characterize the graphs that are characterized by forbidding certain 
(not necessarily odd) asteroids. 

There are a couple of other properties of the knotting graph, especially with 
respect to AT-free graphs. For further results the reader is refered to [12]. 

5 The Knotting Graph Algorithm 

Before we can state the Knotting Graph Algorithm, we first consider the 
relationship between the knotting graph, corresponding to a graph G and the 
component structure of this graph. 

Lemma 4. Let G = (V, E) be a graph and G the corresponding component 
structure, then K[G] = {Vk,Ek) is the knotting graph of G, with 

Vk = {cvw : 3-y,r(; G K} \ {0} 

Ek = {{cvw,Cwv) ■ ^v,w G V such that ^ 0 and c^v 0}. 

Now we are ready to present the Knotting Graph Algorithm (see Algo- 
rithm 2). 

As shown in Section 2, the component structure of G can be computed in 
0{nm) time. By Lemma 4, Vk and Ek, the vertex and edge set of K[G], can 
be determined by the help of the component structure. To compute Vk one has 
to scan through all rows r of the component structure G and for each new label 
one inserts a new copy of the vertex corresponding to row r. Hence, finding Vk 
takes 0{n^). The edge set Ek can be computed by checking for each possible 
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Algorithm 2: Knotting Graph Algorithm 
begin 

for V G G do 

compute connected components of H, using BPS; 

store labels of components in component structure C{iv, •); 

end 

Vk = {cvvj ■ 3v,w G V such that Cvw ^ 0}; 

Ek — {(cvw,Cwv) ■ 3u, w G G such that Cvw 0 and c-wv ^ 0}; 
check whether K[G] contains a triangle; 

end 



row-column pair r,c oi C whether both Crc and Ccr are non-zero. Since there are 
those pairs for our n x n matrix C , this can be done in 0{n^) as well. 

The difficult part of the algorithm is finding triangles in K[G]. The fastest 
known algorithm for this problem is matrix multiplication, which runs in 0(n“). 
Unfortunately, we cannot use this algorithm for our problem. The reason is, that 
the number of vertices of K[G] can be considerably larger than the number of 
vertices of G. There are examples of graphs, showing that there can be 
vertices in K[G], where n is the number of vertices of G. One information that 
we do have about the number of vertices of K[G] is, that there are not more than 
twice as many vertices as edges in K[G], i.e., \Vk\ < 2 \Ex\ (an exception is, of 
course, the case that there are universal vertices in G, but for finding triangles 
in K[G] this case is not of interest). Hence, if we apply an algorithm, that runs in 
0(/(|if/c|)), it is in fact an 0(/(n^)) algorithm, since the number of edges of K[G] 
is equal to the number of edges of G. For the case that the input graph is sparse, 
Alon et al. [1] suggested an improvement of the matrix multiplication algorithm 
for finding a triangle. The complexity of this algorithm is 0 (to“+t); for a < 2.376 
this is Consequently, if we apply the triangle algorithm for sparse 

graphs of Alon et ah, we can decide in 0(m“+T) = 0(n“+r) = 0(mn-|-n^ ®^®) for 
a < 2.376 whether K[G] contains a triangle. Thus we have shown the following 
Theorem. 

Theorem 7. Recognition of AT-free graphs using the Knotting Graph Al- 
gorithm takes 0{mn + time. 



Remark 2. It is an open question, whether there is a certificate for a graph to 
be AT-free, which can be checked in less than O(n^). The knotting graph does 
provide some kind of “partial certificate”, since it can indeed be checked in less 
than O(n^) for containing a triangle. Of course, for being a proper certificate 
one should be able to check also within this time bound whether the knotting 
graph is correctly determined. This can currently be done in 0{nm), implying 
that we have a fast checkable certificate for sparse graphs. 
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Abstract. Several practical instances of network design problems re- 
quire the network to satisfy multiple constraints. In this paper, we ad- 
dress the Budget Constrained Conneeted Median Problem: We are given 
an undirected graph G = {V, E) with two different edge-weight functions 
c (modeling the construction or communication cost) and d (modeling 
the service distance), and a bound B on the total service distance. The 
goal is to find a subtree T of G with minimum c-cost c{T) subject to 
the constraint that the sum of the service distances of all the remaining 
nodes v £V\T to their closest neighbor in T does not exceed the speci- 
fied budget B. This problem has applications in optical network design 
and the efficient maintenance of distributed databases. 

We formulate this problem as bicriteria network design problem, and 
present bicriteria approximation algorithms. We also prove lower bo- 
unds on the approximability of the problem that demonstrate that our 
performance ratios are close to best possible. 



1 Introduction and Overview 

The problem of interfacing optic and electronic networks has become an im- 
portant problem in telecommunication network design [20,21]. As an example, 
consider the following problem: Given a set of sites in a network, we wish to 
select a subset of the sites at which to place optoelectronic switches and routers. 
The backbone sites should be connected together using fiber-optic links in a mi- 
nimum cost tree, while the end users are connected to the backbone via direct 
links. The major requirement is that the total access cost for the users be within 
a specified bound, whereas the construction cost of the backbone network should 
be minimized. 
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Problems of similar nature arise in the efficient maintenance of distributed 
databases [22,4,3,15,7]. Other applications of the Budget Constrained Connected 
Median Problem studied in this paper include location theory and manufacturing 
logistics (see [20,21] and the references cited therein). 

The above problems can be cast in a graph theoretic framework as follows: 
Given an undirected graph G = {V, E) with two different edge-weight functions 
c (modeling the construction cost of the backbone/inter-dat abase links) and d 
(modeling the service distance), the goal is to find a subtree T of G with minimum 
c-cost c{T) subject to the constraint that the sum of the service distances of all 
the remaining nodes v G V \ T to their closest neighbor in T does not exceed a 
specified budget B. 

We study the approximability of the Budget Constrained Connected Median 
Problem. This paper is organized as follows. In Section 2 we formally define 
the problem under study and the notion of bicriteria approximation. Section 3 
contains a brief summary of the main results in the paper and a discussion of 
related work. In Section 4 we prove hardness results. Section 5 contains a fully 
polynomial approximation scheme on trees. An approximation algorithm for the 
general case is presented in Section 6. 

2 Problem Definition and Preliminaries 

Throughout the paper G = (V, E) denotes a finite connected undirected graph 
with n := \V\ vertices and m := \E\ edges. The Budget Constrained Connec- 
ted Median Problem (Bccmed) problem considered in this paper is defined as 
follows: 

Definition 1 (Budget Constrained Connected Median Problem). An 

instance consists of an undirected graph G = (V,E) with two different edge-cost 
functions c (modeling the construction or communication cost) and d (modeling 
the service distance), and a bound B on the total service distance. The problem 
is to find a subtree T of G of minimum cost c{T) := Yhe(^T subject to the 
constraint that the total service distance of each of the vertices from V is at 
most B, that is, 



mediand{T) \— 



v^V 



rCm distdiv.u) < B. 

.GT 



(where distd{v,u) denotes the shortest path distance between vertices v and u 
with respect to the edge- cost function d). 

The problem Bccmed can be formulated within the framework developed 
in [18,13]. A generic bicriteria network design problem, (A,B,S), is defined by 
identifying two minimization objectives, - A and B, - from a set of possible 
objectives, and specifying a membership requirement in a class of subgraphs, - 
S. The problem specifies a budget value on the first objective. A, and seeks to 
find a network having minimum possible value for the second objective, B, such 
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that this network is within the budget on the first objective A. The solution 
network must belong to the subgraph-class S. In this framework Bccmed is 
stated as (total d-service distance, total c-edge cost, subtree). : the budgeted 
objective A is the total service distance mediand(T) with respect to the edge 
weights specified by d, the cost-minimization objective B is the total c-cost of 
the edges in the solution subgraph which is required to be a subtree of the 
original network. 

Definition 2 (Bicriteria Approximation Algorithm). A polynomial time 
algorithm for a hicriteria problem {A,B, S) is said to have performance (a,/3), 
if it has the following property: For any instance of (A,B,S), the algorithm 
produces a solution from the subgraph class S for which the value of objective A 
is at most a times the specified budget and the value of objective B is at most (3 
times the minimum value of a solution from S that satisfies the budget constraint. 

Notice that a “standard” c-approximation algorithm is a (1, c)-bicriteria ap- 
proximation algorithm. A family {Aejg of approximation algorithms, is called a 
fully polynomial approximation scheme or FPAS, if algorithm is a (1, 1 -I- e)- 
approximation algorithm and its running time is polynomial in the size of the 
input and 1/e. 



3 Summary of Results and Related Work 

In this paper, we study the complexity and approximability of the problem 
Bccmed. Our main results include the following: 

1. Bccmed is NP-hard even on trees. This result continues to hold even if the 
edge- weight functions c and d are identical. We strengthen this hardness 
result to obtain strong NP-hardness results for bipartite graphs. 

2. We strengthen the above hardness results for general graphs further and 
show that unless NP C DTIME(A*°si°g^)^ there can be no polynomial time 
approximation algorithm for Bccmed with a performance (1, (1/20 — e) In n), 
where n denotes teh number of vertices in the input graphs. 

Our hardness results are complemented by the following approximation re- 
sults: 

1. There exists a FPAS for Bccmed on trees. 

2. For any fixed e > 0 there exists a (l-Pe, (1 -I- l/e)0(log^ n log log nil-approxi- 
mation algorithm for Bccmed on general graphs. 



3.1 Relationship to the Traveling Pnrchaser Problem 

The Bccmed problem is closely related to a well studied variant of the classical 
traveling salesperson problem called the Traveling Purchaser Problem (see [20] 
and the references therein) . In this problem we are given a bipartite graph G = 
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(M UP,E), where M denotes a set of markets and P denotes the set of products. 
There is a (metric) cost Cij to travel from market i to market j. An edge between 
market i and product p with cost dip denotes the cost of purchasing product p 
at market i. A tour consists of starting at a specified market visiting a subset of 
market nodes, thereby purchasing all the products and returning to the starting 
location. The cost of the tour is the sum of the travel costs used between markets 
and the cost of buying each of the products. The budgeted version of this problem 
as formulated by Ravi and Salman [20] aims at finding a minimum cost tour 
subject to a budget constraint on the purchasing costs. 

It is easy to see that a {a, /3)-approximation algorithm for the budgeted trave- 
ling purchaser problem implies a (a, 2/3)-approximation for Bccmed: just delete 
one edge of the tour to obtain a tree. Using the (H-e, (H-l/e)0(log^mloglog m))- 
approximation algorithm from [20] we get a (H-e, 2(H-l/e)0(log^mloglogm))- 
approximation for Bccmed. Our algorithm given in Section 6 uses the techniques 
from [20] directly and improves this result. 

3.2 Related Work 

Other service-constrained minimum cost network problems have been considered 
in [1,6,12,17,16]. These papers consider the variant that prescribes a budget on 
the service distance for each node not in the tree. The goal is to find a minimum 
length salesperson tour (or a tree as may be the case) so that all the (customer) 
nodes are strictly serviced. Restrictions of the problems to geometric instances 
were considered in [1,12,19]. Finally, the problem Bccmed can be seen as a 
generalization of the classical k-Median Problem, where we require the set of 
medians to be connected. 

4 Hardness Results 

Theorem 1. The problem Bccmed is NP -hard even on trees. This result con- 
tinues to hold even if we require the two cost functions c and d to coincide. 

Proof. We use a reduction from the Partition problem, which is well known 
to be NP-complete [10, Problem SP12]. Given a multiset of (not necessarily 
distinct) positive integers {oi,... ,an}, the question is whether there exists a 
subset U C {1, . . . , n} such that “*■ 

Given any instance of Partition we construct a star-shaped graph G having 
n-\-l nodes {x, xi, . . . , Xn} and n edges (x, Xi), i = 1, . . . ,n. We define c(x, Xi) := 
d{x,Xi) := Oi. Let D := We set the budget for the median cost of the 

tree to be B := D/2. It is easy to see that there exists a feasible tree T of 
cost c(T) at most D/2 if and only if the instance of Partition has a solution. 

□ 

Next, we prove our inapproximability results for general graphs. Before sta- 
ting the hardness result we recall the definition of the Min Set Cover problem 
[10, Problem SP5] and cite the hardness results from [9,2] about the hardness of 
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approximating Min Set Cover. An instance (U,S) of Min Set Cover con- 
sists of a finite set U of ground elements and a family S of subsets of U. The 
objective is to find a subcollection C C 5 of minimum size |5| which contains all 
the ground elements. 

Theorem 2 (Feige [9]). Unless NP C DTIME(iV®(*°s^°s^)), for any e > 0 
there is no approximation algorithm for Min Set Cover with a performance 
of (1 — e) In \U\, where U is the set of ground elements. □ 

Theorem 3 (Arora and Sudan [2]). There exists a constant rj > 0 such that, 
unless P= NP, there is no approximation algorithm for Min Set Cover with 
a performance of lyln |[/|, where U is the set of ground elements. □ 

We are now ready to prove the result about the inapproximability of Boomed 
on general graphs. 

Theorem 4. The problem Boomed is strongly NP -hard even on bipartite 
graphs. If there exists an approximation algorithm for Boomed on bipartite 
graphs with performance a(|F|) € 0(ln\V\), then there exists an approximation 
algorithm for Min Set Cover with performance 2o;(2|17| -I- 2|5|). All results 
continue to hold even if we require the two cost functions c and d to coincide. 

Proof. Let (U,S) be an instance of Min Set Cover. We assume without loss 
of generality that the minimum size set cover for this instance contains at least 
two sets (implying also that \U\ > 2). 

For each k G {2, . . . ,n} we construct an instance Ik of Boomed as follows: 
The bipartite graph constructed for instance Ik has \Vk\ = 2|17|-|-2|5|-|-2 — 
k < 2{\U\ |5|) vertices. First construct the natural bipartite graph with node 

set U yj S. We add an edge between an element node u G U and a set node 
S' € 5 if and only if u G S. We now add a root node r which is connected via 
edges to all the set nodes from S. Finally, we add a set Lfc of |C/| -I- |S| — A: -I- 1 
nodes which are connected to the root node via the edges (l,r), I G Lk. Let 
X := fcfadVfel)] -I- 1. The edges between element nodes and set nodes have 
weight X, all other edges have weight 1. The budget on the median cost for 
instance Ik is set to Bk := \Lk\ -P X\U\ -P |S| — k. 

As noted above, the bipartite graph constructed for instance Ik has |Vfc| = 
2\U\ + 2|S| +2-k< 2{\U\ + |S|) vertices. Thus, a(|Vfc|) < a{2\U\ + 2|S|). 

The main goal of the proof is to show that (i) if there exists a set cover of 
size k, then instance Ik of Bccmed has a solution with value at most k; (ii) any 
feasible solution for instance Ik of Bccmed with cost C < a{\Vk\)k can be used 
to obtain a set cover of size at most 2C. Using these two properties of the reduc- 
tion, we can show that any o;(|U| (-approximation to Bccmed transforms into 
a o;(2|17| -I- 2 1 5 1 (-approximation for Min Set Cover: Find the minimum va- 
lue k* G {1, . . . , n} such that the hypothetical a-approximation algorithm A for 
Bccmed outputs a solution of cost at most a{\Vk\)k* for instance Ik*. By pro- 
perty (i( and the performance of A it follows that k* is no greater than the opti- 
mum size set cover. By property (ii( we get a set cover of size at most 2o;(|Vfe. |(fc* 
which is at most 2o;(2|17| -I- 2|5|( times the optimum size cover. 
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We first prove (i). Any set cover C of size k can be used to obtain a tree by 
choosing the subgraph induced by the set nodes corresponding to the sets in C 
and the root node r. Clearly, the cost of the tree is k. Since the sets in C form a 
cover, each element node is within distance of X from a vertex in the tree. Thus, 
the total median-cost of T is no more than X\U\ -I- |5| — fc -I- \Lk\ = Bk- 

We now address (ii). Assume conversely, that T is a solution for Ik with 
value C, i.e., a tree with mediand(T) < Bk and c{T) = C < a(|Vfc|)^- We first 
show that the root node r must be contained in the tree. 

In fact, if this were not the case, then at least \Lk\ — l nodes from Lk can not 
be in T either. Moreover, these at least \Lk \ — 1 nodes are at distance at least 
two from any node in the tree. Moreover, since X > C -I- 1, the tree T can not 
contain any edge between set nodes and element nodes. Thus, T consists either 
of a single element node or T does not contain any element nodes. In the first 
case, the median cost of T is at least 

2X{\U\ -1) + 3|Lfc| = Bk + {\U\ - 2)X + \U\ + \S\ - k + 2 > Bk, 

which contradicts that T is feasible. In the second case, (T does not contain 
element nodes) either T consists of a single node from Lk or does not contain 
any node from Lk- Thus, we get that the median cost of T is at least 

A|[/| -|- 2(|Lfc| — 1) -|-l-|-2 = A|[/| -|- \Lk \ -l-|C/|-|-|iS|— A:-|-l = Bk + 1 > Bk, 

which is again a contradiction. (The additive terms in the above calculation 
stem from the fact that r is not in the tree and a set node or the remaining node 
from Lk is at distance at least two from the tree). 

We now show that the collection C of set nodes spanned by the tree T can 
be used to obtain a cover of size at most 2C. Let C/c C [/ be the subset of 
element nodes not covered by the collection C of sets. For each element u G Uc 
its distance to any node in T is at least X-l-1. The median cost of T thus satisfies: 

mediand(T)> \Lk\ +X\U\ + \Uc\ + |5| - \T n {Lk U S)\ = Bk + k+ \Uc\ -C. (1) 

On the other hand, since T is feasible, mediand(T) < Bk, and hence we get 
from (1) that |C7c| < C — fc < C. In words, the number of elements left uncovered 
by the collection C of sets is at most the cost C of the tree T. Hence, we can 
augment C to a valid cover by adding at most \Uc\ < C sets. This leads to a set 
cover of size at most C. □ 

The instances of Min Set Cover used in [9] have the property that the 
number of sets is at most \U\^, where Lf is the ground set (see [8] for an explicit 
computation of the number of sets used). Thus from Theorem 4 we get a lower 
bound for Bccmed of (1/20 — e) In |y| (assuming that NP % DTIME(A^*°s^°S'^)). 
Since the number of sets in any instance of Min Set Cover is bounded by 2P , 
we can use the result from [2] to obtain a result under the weaker assumption 
that P yf NP: 
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Theorem 5. (i) Unless NP C Jqj. Q^jiy e > Q there can be 

no polynomial time approximation algorithm for Bccmed with a perfor- 
mance (1/20 — e) In n. 

(ii) Unless P= NP, for any e > 0 there is no approximation algorithm for Bcc- 
med with a performance of (1/8 — e) In Inn. □ 

5 Approximation Scheme on Trees 

We first consider the problem Bccmed when restricted to trees. We present 
an FPAS for a slightly more general problem than Bccmed, called generalized 
Bccmed in the following: We are additionally given a subset U Q V of the 
vertex set and the budget constraint requires that the total service distance of 
all vertices in U (instead of V) does not exceed B. 

Theorem 6. There is a FPAS for the generalized Bccmed on trees with run- 
ning time 0(log(jiC)n^ /e^) , where C denotes the maximum c-weight of an edge 
in a given instance. 

Proof. Let T = (V/ E) be the tree given in the instance I of Bccmed. We root 
the tree at an arbitrary vertex r G V. In the sequel we denote by Ty the subtree 
of T rooted at vertex v G V. So Ty = T. Without loss of generality we can 
assume that r is contained in some optimal solution / (we can run our algorithm 
for all vertices as the root vertex) . We can also assume without loss of generality 
that the rooted tree T is binary (since we can add zero cost edges and dummy 
nodes to turn it into a binary tree). 

Let T* = (V*,E*) be an optimal solution for I which contains r. Denote 
by OPT = c(T*) its cost. Define C := maxeg£;c(e) and let K € [0,nC] be an 
integral value. The value K will act as “guess value” for the optimum cost in the 
final algorithm. Notice that the optimum cost is an integer between 0 and nC . 

For a vertex v G V and an integer k G [0, K] we denote by D[v, k] the 
minimum service cost of a tree servicing all nodes in U contained in the 
subtree Ty rooted at v and which has following properties: (1) T*j, contains v, 
and (2) c(T*^,) < k. If no such tree exists, then we set D[v,k] := -Poo. Notice 
that 



c{T*) = min{ k : D[r, k] < B}. 

Let V G V he arbitrary and let vi , V2 be its children in the rooted tree T. We 
show how to compute all the values D[v,k], 1 < k < B given the values D[vi,-], 
i = 1,2. 

For i = 1, 2 let Si := YhweTy nu ^)- k then none 

of the vertices in T„. can be contained in Tf f.. Let 

Xk '■= S\ -P S2 

and 



Yfc := min{ D[vi,k'] -P D[v 2 , k"] \ k' -\- k" = k — c(v, vi) — c(v, V 2 ) }. 
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Then we have that 

D[v,k] = min{S '2 + D[vi,k- c(-y, ui)], S'! + D[v 2 ,k - c{v,vi)],Xk,Yk}. 

The first term in the last equation corresponds to the case that v\ is in T* ^ but 
not V 2 - The second term is the symmetric case when V 2 is in the tree but not v\. 
The third term concerns the case that none of Vi and V 2 is in the tree. Finally, 
the fourth term models the case that both children are contained in 

It is straightforward to see that this way all the values D[v, k], 0 < k < K can 
be computed in 0{K^) time given the values for the children vi and V 2 - Since 
the table values for each leaf of T can be computed in time 0{K), the dynamic 
programming algorithm correctly finds an optimal solution within time 0{nK^). 

Let e > 0 be a given accuracy requirement. Now consider the following test 
for a parameter M G [0, (n— l)^]: First we scale all edge costs c(e) in the graph 
by setting 



c“(e) 



(n - l)c(e) 
Me 



(2) 



We then run the dynamic programming algorithm from above with the scaled 
edge costs and K := (1 + l/e)(n— 1). We call the test successful if the algorithm 
gives the information that D[r, K] < B. Observe that the running time for one 
test is 

We now prove that the test is successful if M > OPT. To this end we have 
to show that there exists a tree of cost at most K such that its service cost is at 
most B. Recall that T* denotes an optimum solution. Since we have only scaled 
the c-weights, it follows that T* is also a feasible solution for the scaled instance 
with service cost at most R. If M > OPT we have 



H <="(«) < E 

e£T* e^T* 



( (n - l)c(e) 
V Me 




n — 1 

< ^-^ + \T*\ < 




!)• 



Hence for M > OPT, the test will be successful. We now use a binary search to 
find the minimum integer M' G [0, (n — 1)C] such that the test described above 
succeeds. Our arguments from above show that the value M' found this way 
satisfies M' < OPT. Let T' be the corresponding tree found which has service 
cost no more than B. Then 



c(e) < 

eGT' 



M'e 
n — 1 



E '"'(e) < 

eGT' 



M'e 



n — 1 




1) < (l + e)OPT. 



(3) 



Thus, the tree T' found by our algorithm has cost at most 1 + e times the 
optimum cost. The running time of the algorithm can be bounded as follows: 
We run 0(log(nC)) tests on scaled instances, each of which needs time 0(r? je^') 
time. Thus, the total running time is 0(log(nC')n^/e^), which is bounded by a 
polynomial in the input size and 1/e. □ 
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6 Approximation Algorithm on General Graphs 

In this section we use a Linear Programming relaxation in conjunction with filte- 
ring techniques (cf. [14]) to design an approximation algorithm. The techniques 
used in this section are similar to those given in [20] for the Traveling Purchaser 
Problem. The basic outline of our algorithm is as follows: 

1. Formulate the problem as an Integer Linear Program (IP). 

2. Solve the Linear Programming relaxation (LP). 

3. With the help of the optimal fractional solution define a service- cluster for 
each vertex. The goal is to service each vertex by one node from its cluster. 

4. Solve a Group Steiner Tree problem on the clusters to obtain a tree. 



Integer Linear Programming Formulation and Relaxation. In the follo- 
wing we assume again that there is one node r (the root) that must be included 
in the tree. This assumption is without loss of generality. Consider the following 
Integer Linear Program (IP) which we will show to be a relaxation of Bccmed. 
The meaning of the binary decision variables is as follows: Ze = 1 if and only 
if edge e is included in the tree; furthermore x^u, = 1 if and only if vertex w 
is serviced by v. The constraints (4) ensure that every vertex is serviced, con- 
straint (5) enforces the budget-constraint on the service distance. Inequalities (6) 
are a relaxation of the connectivity and service requirements: For each vertex w 
and each subset S which does not contain the root r either w is serviced by a 
node in P \ S' (this is expressed by the first term) or there must be a an edge 
of T crossing the cut induced by S (this is expressed by the second term) . 



(IP) min 

eeE 

Xy^o = 1 {w GV) (4) 

vev 

d{v,w)xyyj < D (5) 

vGV wGV 

Xvw Zyu>l {w gV,S cv,r ^ S) ( 6 ) 

v^S vGS,u^S 

ze G {0, 1} (e G E) (7) 

Xyyy€{0,l} {v & V , W & V) ( 8 ) 



The Linear Programming relaxation (LP) of (IP) is obtained by replacing 
the integrality constraints (7) and (8) by the constraints 2e G [0, 1] (e G E) and 
Xyy, G [0, 1] {v ev,w e V). 



Lemma 1. The relaxation (LP) of (IP) can be solved in polynomial time. 
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Proof. We show that there is a polynomial time separation oracle for the con- 
straints (6). Using the result from [11] implies the claim. 

Suppose that {x, z) is a solution to be tested for satisfying the constraints (6) 
for a fixed w. We set up a complete graph with edge capacities Zyu (u,v € U). 
We then add a new node w and edges (w, v) of capacity Xy,y for all € U. It is 
now easy to see that there exists a cut separating r and w of capacity less than 
one if and only if constraints (6) are violated for w. □ 

Service Clusters and Group Steiner Tree Construction. Let e > 0. De- 
note by {x, z) the optimal fractional solution of (LP) and by Zlp := c{e)ze 

the optimal objective function value. For each vertex w € V define the value 

Dw '■= ^ d{v,w)xyw 
vev 

and the subset (service cluster) 

Gtu{e) := {v G V : d{v, w) < (1 -I- e)Dyj }. 

The value Dy, is the contribution of vertex w to the total service cost in the 
optimum fractional solution of the Linear Program. The set Gw(e) consists of 
all those vertices that are “sufficiently close” to w. 

Lemma 2. For each w G V we have YhveGn,(e) > e/(l + e)- 

Proof. If the claim is false for w G U then we have > 1 — e/ (H-e) = 

l/( 1 -I- e) . Thus 

Dyy ^ ^ d{y ^ w'jXyyy ^ ^ ^ d(^V ^ W^Xyyj ^ ( 1 “t“ D yj ^ ^ Xyyy ^ dD yj . 

veV v<fGy,{e) 

This is a contradiction. Hence the claim must hold. □ 

The Group Steiner Tree Problem (GST) is defined as follows: Given a com- 
plete undirected graph G = {V, E) with edge weights c(e) (e G E) and a collec- 
tion Gi, . . . ,Gfc of (not necessarily disjoint) subsets of V, find a subtree of G 
of minimum cost such that this tree contains at least one vertex from each of 
the groups Gi , . . . , G^ . Gharikar et al. [5] gave an approximation algorithm for 
GST with polylogarithmic performance guarantee. We will use this algorithm as 
a subroutine. Gonsider the instance of GST on the graph G given in the instance 
of Bccmed where the groups are the sets Gyj(e) (w G V), and the edge weights 
are the c- weights. This problem is formulated as an Integer Linear Program as 
follows: 

(GST) min c(e)ze 

e€E 

^ Zvw >1 {S C V,r ^ S, Gy,(e) C S for some w) 

vGS,w^S 

Ze € { 0 , 1 } 



(eGE) 




Budget Constrained Minimum Cost Connected Medians 
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The algorithm [5] finds a group Steiner tree of cost at most 0{log^m log log m) 
times X^eG-E = ^LP-GST, where z* denotes the optimal fractional solution 
of the LP-relaxation (LP-GST) and ^lp-GST denotes its objective function value. 



Lemma 3. Denote by Zip_asT otid Zip the optimal values of the LP-relaxa- 
tions of the Integer Linear Program (GST) and (IP), respectively. Then 
Zlp-gst < (1 + l / e )^ LP - 



Proof. We show that the vector z defined by := (1 + l/e)z„„ is feasible 
for the LP-relaxation of (GST). This implies the claim of the lemma. To this 
end let S be an arbitrary subset such that r ^ S, Gw{e) C S for some w and 
ux^S ^ Since (x,z) is feasible for (LP), it satisfies constraint (6), 
i-e-, J2v<fs + J2ves,w<fs ^vw>^- Hence we get that 



E ^ 

vGS,w^S 



> 1 - 



v^S 



> 1 - 







e 



1-f e' 



(since Gyj{e) C S) 



(by Lemma 2) 



Multiplying the above chain of inequalities by 1 -I- 1/e yields the claim. □ 

Hence we know that .^lp-GST < (^ + e) ^lp — (^ + e) OPT. We can now 
use the algorithm from [5] to obtain a group Steiner tree. By the last chain of 
inequalities this tree is within a factor (l-|-l/e)(!l(log^ mloglogm) of the optimal 
solution value for the instance of Bccmed while the budget constraint on the 
service distance is violated by a factor of at most 1 -I- e: 

Theorem 7. For any fixed e > 0 there is a {l + e, {l + l/e)0(log^ mloglogm))- 
approximation algorithm for Bccmed. □ 
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Abstract. A mixed hypergraph is a hypergraph with edges classified as 
of type 1 or type 2. A vertex coloring is strict if no edge of type 1 is to- 
tally multicolored, and no edge of type 2 monochromatic. The chromatic 
spectrum of a mixed hypergraph is the set of integers k for which there 
exists a strict coloring using exactly k different colors. A mixed hypertree 
is a mixed hypergraph in which every hyperedge induces a subtree of the 
given underlying tree. We prove that mixed hypertrees have continuous 
spectra (unlike general hypergraphs, whose spectra may contain gaps [cf. 

Jiang et al.: The chromatic spectrum of mixed hypergraphs, submitted]. 

We prove that determining the upper chromatic number (the maximum 
of the spectrum) of mixed hypertrees is NP-hard, and we identify several 
polynomially solvable classes of instances of the problem. 

1 Introduction 

Definition 1. A mixed hypergraph is a triple H = {V,C,T>), where V is a set of 
vertices andC,!) are sets of hyperedges (hyperedges are subsets of the vertex set). 
A vertex coloring of a mixed hypergraph is strict if every edge e € C contains 
two vertices of the same color, and every edge e € T> contains two vertices of 
different colors. A strict coloring that uses exactly k distinct colors is called a 
strict fc-coloring. The chromatic spectrum of H is the set Sp{H) of integers k 
such that H has a strict k-coloring. The spectrum of H is called continuous if it 
is an interval (in the set of integers). We denote x{H) = maxS'p(iJ) the upper 
chromatic number of H . 
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The notion of mixed hypergraphs was introduced by Voloshin [10]. The con- 
cept is steadily gaining more interest both for potential applications and inte- 
resting theoretical results. Applications to list colorings, integer programming, 
scheduling and molecular biology can be found in [11,9]. Mixed colorings of 
block designs of various types were considered in [3, 7, 6, 8]. Other coloring pro- 
blems studied from a different point of view can be rephrased in terms of mixed 
hypergraphs [5]. 

Solving a long-standing open problem, Jiang et al. [4] showed that there are 
mixed hypergraphs whose spectra have gaps, i.e., are not continuous. Moreover, 
for every finite set of integers greater than 1 there exists a mixed hypergraph 
whose chromatic spectrum coincides with the given set. They also showed that 
spectra of interval hypergraphs are continuous. (A hypergraph is interval if its 
vertex set allows a linear ordering such that every edge is an interval in this 
ordering.) In this paper, we extend the former result to the class of hypertrees. 

Definition 2. A mixed hypertree is a mixed hypergraph H = (V,C,T>) such 
that there exists a tree T with vertex set V (T) = V and such that every edge of 
H induces a subtree in T. 

Without loss of generality, we assume that every edge of our hypertree has 
size at least 2, and every C-edge has size at least three (the endpoints of a C-edge 
of size two must be colored by the same color and we can contract this edge). 
We also assume that the hypertree has at least one edge. Every 2-coloring of the 
underlying tree (i.e., a bipartition of T) is then a strict coloring of H, as every 27- 
edge contains two vertices adjacent in T (and hence colored by different colors), 
and every C-edge contains at least 3 vertices (and hence at least 2 vertices of the 
same color by the pigeon-hole principle). Thus we have the following: 

Observation 1. For every hypertree H, 2 belongs to the chromatic spectrum of 
22 and is its minimum member, unless H has no V-edges. 

Our aim is to prove the following theorem: 

Theorem 1. The chromatic spectrum of any mixed hypertree is continuous, in 
particular Sp{H) = [2,x(22)] if FI contains a V-edge, and Sp{H) = [l,x(22)] 
otherwise. 

This theorem is proved in the next section. Later, we address the algorithmic 
questions. In general, the questions of deciding if a mixed hypergraph is strict 
2-colorable, or whether its upper chromatic number has a given value may be 
of different complexity; e.g., if 22 has no C edges then x(22) = n while it is NP- 
complete to decide whether 2 G Sp{H) (the problem of bicolorability of ordinary 
hypergraphs). Due to Theorem 1, these two problems are equally difficult for 
mixed hypertrees: 

Proposition 1. The problem of determining the upper chromatic number of a 
mixed hypertree and the problem of deciding k € Sp{H) for an input integer k, 
are polynomially equivalent. 
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Proof. Given k > 2, this k belongs to Sp{H) if and only if fc < x{H). On the 
other hand, the upper chromatic number can be found by deciding for every 
k < n = \V\ whether H is strict fc-colorable. □ 

2 Chromatic Spectra of Mixed Hypertrees 

First, we show that a minimal counterexample to Theorem 1 (if it existed) would 
have T> = E{T), i.e. the edges of the underlying tree, E{T), would be precisely 
the 2?-edges of El. 

Lemma 1. Let H he a hypertree with diseonnected spectrum and with minimum 
possible number of vertices. Then every tree edge xy is a subset of some V-edge 
ofH. 

Proof. Suppose H has 2 vertices, say x,y, adjacent in T which do not belong 
together to the same P-edge of H. Let Ti be the largest subtree of T which 
contains x but not y, and let T 2 be the largest subtree containing y but not x. 
Let Hi, i = 1,2, be the hypertree induced by Ti, i.e.. Hi contains exactly those 
edges of H that consist only from vertices of Ti. Both Hi and H 2 are smaller 
than H and hence have continuous spectra. 

Denote c = x(iL) and consider a c-coloring / of H. Then / restricted to Hi 
is a strict coloring and uses some Ci < c colors, and similarly, / restricted to H 2 
uses some C2 colors. The union of these two color sets is the set of c colors used 
on H , and hence 

Cl + C2 > c. 

For i = 1,2 and for every ti, min Sp{Hi) <ti<Ci, consider a f^-coloring fi of Hi. 
If necessary, rename and permute the colors so that fi{x) = f 2 {y) and so that 
this is the only color that fi and /2 have in common. Regard the union of fi 
and /2 as coloring g of the entire H. This coloring uses ti +t 2 ~ 1 colors, and it 
is a strict coloring of H: every T>-edge of H lies entirely within Hi or within H 2 , 
and hence cannot be monochromatic; no C-edge that lies totally within Hi or 
within H 2 is totally multicolored since it was presented and hence well colored 
in Hi or H 2 , and all the remaining C-edges are well colored because they contain 
both X and y (and g{x) = g{y)). 

If we let ti and t 2 range over Sp{Hi) and Sp{H 2 ), respectively, we obtain 
strict t-colorings of H at least in the range 3<t<Ci+C2 — 1. Since Ci + C 2 — 1 > 
c — 1, this covers the range [3, c — 1]. But H has a strict 2-coloring (Lemma 1), 
and a strict /c-coloring by the definition of c = x(H). Hence H has a continuous 
spectrum, contradicting the assumption. □ 

We define the size of H as the sum of the sizes of its edges, i.e., s{H) = 
X^eGCuC l®l- 

Lemma 2. Let H he a hypertree with disconnected spectrum, and of minimum 
possible size. Then all V-edges have size 2. 
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Proof. Suppose H has a P-edge e of size greater than 2. Denote c = x{H), 
and fix a strict c-coloring f oi H . The edge e contains 2 vertices, say x and y, 
such that f{x) yf f{y) and such that x and y are adjacent in the underlying 
tree T (otherwise e would be monochromatic in /). Consider H' = {V,C,{T>\ 
e) U {{x,y}}), i.e., replace e by the pair x,y. The resulting hypergraph H' is a 
hypertree again, because xy is an edge of T. Since / is a strict coloring of H' 
as well, c G Sp(H'). And since s{H') < s{H), and H was a minimum hypertree 
with disconnected spectrum, the spectrum of H' is continuous, i.e., for every 
t,2 < t < c, H' has a strict t-coloring, which is also a strict t-coloring of H. This 
contradicts the assumption of H having a disconnected spectrum. □ 



Corollary 1. Let H he hypertree with disconnected spectrum, with minimum 
possible number of vertices, and with minimum possible size. Then T> = E(T). 

We therefore restrict our attention to mixed hypertrees with T> = E(T), 
and hence use the notation E[ = {T,C,T> = E{T)). Our goal is to show that 
all such hypertrees have continuous spetra. That would prove Theorem 1, since 
if there were a counterexample, then a minimum counterexample would satisfy 
T> — E(T). In order to prove this, we introduce a seemingly more general problem 
of finding strict colorings of precolored hypertrees. The input of this problem 
consists of a mixed hypertree H = {T,C, E{T)) and k disjoint independent sets 
Si, S 2 , . . . , Sk of vertices, and the task is to find a strict coloring which colors 
all vertices of Si with color bi, for each i = 1,2, ... ,k. Such a coloring will be 
called a precoloring extension. (Here and later on, we refer to a set of vertices as 
an independent set if it is independent in the underlying tree T.) 

Lemma 3. Let H he a hypertree and let S\, S 2 , . ■ . , Sk be disjoint nonempty 
independent sets of vertices of H precolored by 5i, 62 , . . . , respectively . Lf H 
has a precoloring extension using c > k + 2 colors, then it has a precoloring 
extension which uses exactly k + 2 colors. 

Proof. Let the precoloring extension of H and Si, . . . , Sk which uses c colors be 
/. For every C-edge e, fix two different vertices Xe, 2 /e G e such that f{xe) = f{ye) 
(every C-edge contains at least one pair of such vertices). Replace every C-edge 
e by the C-edge e' consisting of all vertices of the Xe, 2 /e-path in T. For the 
resulting hypertree H' , f is still a strict precoloring extension, and vice versa, 
every precoloring extension for P[' (and Si,. .. , Sk) is also a strict precoloring 
extension for H (and Si,. . . , Sk). Note that in H' , every C-edge induces a path 
in the underlying tree, and therefore it makes sense talking about end-vertices 
of C-edges. 

Now for H' , enlarge the sets Si (if necessary) according to the following 
procedure: If there is a C-edge e' which starts in a vertex Xe G Si for some i, and 
ends in a vertex ye ^ Si, then add ye into Si. Repeat this step until no such edge 
exists. Note that the new Sfs are still disjoint after this procedure is finished, 
since we have only added vertices of color bi into the set Si. 
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For the sake of brevity write 



k 

s=\Js,. 

i=l 

Next, we define the set A of auxiliary two-element edges, which contains the 
pairs Xeye, e € C such that e \ {xe, ye} C S. 

Define the auxiliary graph G as the graph obtained from T by deleting S 
and then contracting all edges of A by collapsing their end-vertices. We claim 
that G has at least two vertices and is acyclic. Indeed, since the coloring / uses 
at least two colors other than b\ . . .bk, these two additional colors must remain 
in G, since vertices of different colors cannot be collapsed. Also, suppose G has 
a cycle. This cycle corresponds to a cycle in T -|- A which contains at least three 
edges of T. Replacing every edge in A by the corresponding (xe?/e)-path in T, 
we get a closed walk in T which traverses these (at least) three edges each only 
once, a contradiction with acyclicity of T. 

Since G is acyclic, it can be colored by two colors, say colors bk+i and bk+ 2 - 
And since G has at least two vertices, we can take such coloring which actually 
uses both colors. We claim that this coloring yields a strict precoloring extension 
of H' and Si, ... ,Sk, and it obviously uses exactly k + 2 colors. 

This coloring is strict on all P-edges (i.e., edges of the tree T) by its definition. 
We will show that every C-edge e' contains two vertices of the same color: 

Case 1: e' has at least one end- vertex in S\ Then e' has both end- vertices in S 
by the enlargment procedure, and these endpoints are in the same Si, i.e., they 
are colored by the same color. 

Case 2: e' has both endpoints in T\S' and all other vertices in S\ Then Xeye G A, 
Xe and ye are unified into one vertex of G, and so Xe and ye get the same color. 
Case 3: e' has at least 3 vertices in T\S: Then these 3 vertices are colored by 
colors bk+i Etnd bk +2 and two of them must get the same color. □ 



Lemma 4. Let H be a mixed hypertree and let Si, S 2 , . . ■ , Sk be disjoint non- 
empty independent sets of vertices of H. If H has a precoloring extension using 
c > k 2 colors, then it has precoloring extensions using exactly t colors for 
every t,k 2 <t < c. 

Proof. We will prove the statement by induction on the number of vertices in 
T\S (again S = U^=i Note also that the statement holds trivially true due 
to Lemma 3 if c = A: -I- 2, and so we assume that c > k 2 further on. 

Let the precoloring extension of H and Si,...,Sk which uses c colors be 
/. Replace every C-edge e by e' as in the preceding proof (e' is a Xe,j/e-path 
in T and f(xe) = /(2/e))- Enlarge the sets Si as in the preceding proof (if 
Xe G Si and ye ^ S then add ye into Si and repeat). Denote S'' the enlarged 
set Si, i = l,...,k, and S' = Ui=i 'S'i- Let H' be the hypertree obtained by 
replacing each e by e'. Again, / is a strict precoloring extension of H' (and S') 
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using exactly k colors, and every precoloring extension of H' (and S') is also a 
precoloring extension of H (and S). 

H contains at least one vertex not belonging to S' (e.g., the vertex colored 
by a color different from bi, . . . ,bk)- Choose a vertex not belonging to S' , call it 
X, such that all other vertices in V \ S' lie in the same connected component of 
T— {x}. Let T 2 be this connected component and let Ti = T\T 2 he the subtree 
rooted in x that is disjoint with T 2 - Finally, let y be the neighbor of x in T 2 . 

Now let C be the set of endpoints of C-edges e' that have x as the other 
endpoint, formally 

C = {ye'- Xe = x,e & C}\J {xe ■■ Ve = x,e & C}. 

We distinguish three cases: 

Case 1: f{x) = bi for some i. Then redefine S'' := S'' U C U {x}. The hypertree 
H' has a strict precoloring extension for this new S( , . . . , S(, (namely /) and 
has less unprecolored vertices than H (with precoloring Si, . . . ,Sfc). Hence by 
induction hypothesis, H' has precoloring extensions using exactly t colors for 
every fc + 2 < t < c. Each of these extensions is also a strict precoloring extension 
for H and Si, ... , S^. 

Case 2: f{x) yf bi for alH = 1, 2, . . . , fc and (7 = 0. Consider H 2 as the hypertree 
on T 2 with all edges e' that lie entirely within T 2 as the C-edges of H 2 - All the 
C-edges e' not entirely lying in T 2 will be colored properly, since their both end 
points are in S' for the same i and thus the following construction assign their 
ends the same color (bi). Set S" = S'nT 2 and say that k' of these are nonempty. 
The hypertree H 2 has less unprecolored vertices than H (at least x is missing) 
and we may use induction hypothesis. The coloring / restricted to H 2 is a strict 
precoloring extension and it uses at least k' + c— k—1 colors (all the new colors 
used by / are used in T 2 except possibly for the color of x) . But since c — k > 3, 
k' + c—k — l > A:' -I- 2 as we need for Lemma 3. It follows that H 2 has a precoloring 
extension which uses exactly t colors for every t, k' + 2 < t < k' + c— k—1. Each 
such coloring g' can be re-extended to a coloring g of iL by giving x the color 
g{x) = bk +2 if g'{y) = bk+i and setting g(x) = bk+i otherwise. Such a coloring g 
then uses t + k — k' colors, and this number ranges from A: -|- 2 (for A = A:' -I- 2) to 
c — 1 (for t = k' + c — k — 1). This is what we needed, as a precoloring extension 
using c colors exists by the assumption itself. 

Case 3: f{x) yf bi for all i = 1,2,..., A: and (7 yf 0. Then (7 C T 2 and 
(7 n S" = 0. Consider H 2 as the hypertree on T 2 with all edges e' that lie entirely 
within T 2 as the (3-edges of H 2 ] the other (3-edges will be colored properly for the 
same reason as in the previous case. Set S'" = S'nT 2 , S^_|_;^ = CU{x} and let k' of 
the sets S'/,i < k he nonempty. The hypertree H 2 has less unprecolored vertices 
than H (at least x is missing), but now A;'-|-l < A: -1-1 is the number of colors used 
for the precoloring. By induction hypothesis, since H 2 has a strict precoloring 
extension (namely / restricted to H 2 ) which uses exactly c — k + k' > k' + 3 
colors, H 2 has a precoloring extension which uses exactly t colors for every t, 
k' + 3 < t < k' + c — k. Each such coloring g' can be re-extended to a coloring g 
of H by giving x the color g{x) = bk +2 if g'iu) = h+i and setting g{x) = bk+i 
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otherwise (note that in the former case all C edges starting in x and ending in 
C are well colored because they contain vertex y G T 2 of the same color bk+i as 
the endpoint in C). Such a coloring g then uses t + k — k' colors, and this number 
ranges from fc + 3 (for t = fc' + 3) to c (for t = k' + c — k). This is what we 
needed, since a precoloring extension using fc + 2 colors exists by Lemma 3. □ 

For the proof of Theorem 1, use Lemma 4 with k = 0 and c = x(H). 



3 Dynamic Programming Algorithm 

One might expect that deciding strict fc-colorability of mixed hypertrees should 
be solvable in polynomial time due to the structure of the underlying tree. We 
will see in the next section that this is not true (unless P=NP). In this section 
we study the more or less straightforward dynamic programming algorithm to 
decide the existence of a strict coloring of a mixed hypertree H . Henceforth, n 
will denote the number of vertices of H , k the number of colors, s the maximum 
size of a hyperedge and Ig (/„) the maximum iL-load of an edge (a vertex) of 
the underlying tree T, where the load of an edge (a vertex) is the number of 
hyperedges that include both of its end points (the vertex itself). 

Consider a given mixed hypertree H = (T,C,T>) on an underlying tree T 
rooted in a leaf r € V{T). Let e = {u,v} be an edge of the underlying tree T 
with V being closer to r than u (in T). The hypergraph induced by the vertices 
of T in the same connected component of T — {u} as u will be denoted by 
He- (This subhypertree contains exactly those hyperedges of H that have all 
vertices in Hg.) Let ei, 62 , . . . , be the hyperedges containing v and not fully 
contained in He- We maintain boolean arrays ^®(oi, 02 , &i, 62 , . . . , 6 m, c) indexed 
by ai, 02 G {1,2,..., fc}, 61 , . . . , 6 m G (0, 1, ■ • ■ , and c G (1, 2, . . . , A:} such that 

^®(ai,a 2 , 6 i, 62 ,..., 6 m,c) = true 

if and only if iJg allows a strict coloring (p : — ( 1 , 2 , . . . , fc} such that 

1 . (j>{v) = ai,(j>{u) = 02 , 

2 . for every j = 1 , 2 , . . . , m, if bj > 0 then ej fl V {Hg) contains a vertex of color 
bj, and if bj = 0 then Cj is satisfied within V{He), i.e., ej fl V{He) contains 
two vertices of the same color if Cj G C and Cj fl V {He) contains two vertices 
of different colors if ej G V, 

3. (j) uses exactly c colors on V{He). 

The space needed to maintain these arrays is 0{nk^'^^'^). It is also clear that 
H allows a strict coloring using exactly k colors if and only if 



<^'’'“( 01 , 02 , k) = true 

for some oi and 02 where Ve is the tree-edge containing the root r (note that 
m = 0 in the case of the root since all hyperedges are fully contained in = H). 
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The initialization of the arrays for the leaves of T is obvious. We will only 
add hints how to update the arrays. Straightforward bottom-up strategy is used: 
consider for vertex x with children yi,y 2 , ■ ■ ■ ,yg, a true entry of the arrays 
for each i and derive the information for where z is the parent of x. This 
would be, however, exponential in g, the number of children of x, i.e., exponential 
in d. Our aim is to be a little more careful here. Let ei, . . . , be the hyperedges 
that contain x. We introduce another array ifj, 1 < j < g, such that 

!f'|’(ai, 02 , 6 i,...,&m,c) =true 

if and only if the hypertree L/J induced by {z}UlJ^^;^ allows a strict coloring 
Ip : — >■ { 1 , 2 , ... , k} such that 

1 . ip{z) = ai,(l){x) = 02 , 

2. for every = 1, 2, . . . , m, if > 0 then eh O V{Hj) contains a vertex of 
color bh, and if bh = 0 then Ch is satisfied within 

3. Ip uses exactly c colors on V{Hj). 

The “horizontal” recursion step J — >■ J + 1 consists of combining true entries of 
S'® with a true entry of which takes 0{kP^^^).0{kP'^^'^).0{2^'’).0{k) = 

time, since there are at most true entries ^ com- 
bined with at most true entries ifj (we only consider those that use 

the same color for x), and for this choice of two entries we check all hyperedges 
containing x. If a hyperedge is fully contained in Hj we check if it is satisfied 
in or in or by their combination. For hyperedges that are not fully 

contained in Hj, we choose which color will be propagated (henceforth the fac- 
tor 0(2*”)). Finally, we compare the colors appearing in and in Say 

<p^vj+i explicitly names ki colors, names /c 2 colors, and k$ of these are in 
common. In total, <P^yi+^ uses Ci colors and S'® uses C 2 colors. The anonymous 
Cl — ki colors may be those from k 2 — k^ explicit colors for d'J, but some or all 
of them may be completely new. This consideration gives an interval of possible 
numbers of colors used in and the range of the interval can be determined 

in time linear in k. For all these values of parameters we set to true. 

Skipping the tedious details, we have sketched the argument for the following 
proposition: 

Theorem 2. It can be decided if H allows a k-strict coloring in time 0{n ■ 
2*” fc®+*”+*”)). 

A straightforward corollary reads that the upper chromatic number of a 
mixed hypertree can be determined in polynomial time if the maximum vertex 
load Iv is bounded. 

4 Upper Chromatic Number is NP-Hard 

Theorem 3. It is NP-complete to decide ifx{H) > |(n-|-2) even for hypertrees 
H = (V,C, E(T)) with maximum edge load 4 and hyperedge size at most 1, where 
n is the number of the vertices of H . 
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Proof. Given a CNF formula <P with the set C of clauses over a set X of variables, 
define a tree T with 3m + 1 vertices (here m = |X |) by 

V{T) = {r)\J X\J {x+ : X € X}\J {x~ : X € X} 

and edges 

E{T) = {x~^ x,x~ x,xr : x € X} 
and define a mixed hypergraph H on the same vertex set by 

V = E{T) 

C = {{x, x~^ , r} : X G X} U {Cc '■ c G C} 

where the set Cc is the smallest connected subtree of T containing the literals 
of the clause c (x“'" is the literal for positive occurrence of x and x~ is the literal 
for negated occurrence). 

We claim that H allows a (2m+l)-strict coloring if and only if is satisfiable. 
Suppose first that / : V (T) — >• {1,2,..., 2m + 1} is a strict coloring. For every 
variable x, the C edge |x,x''‘,x“,r} implies that at least one of the following 
three cases occurs: 

= f{r) 
f{x~) = /(r) 

= f{x~) 

and to have the total number of colors 2m + 1, exactly one of these three equa- 
lities holds true and no other equalities among the colors hold whatsoever, in 
particular, for every variable x 

= f{x~) ^ f{r) yf /(x"). 

Define a truth valuation (p by 

MU) = / true if /(r) = /(x+) 

' { false if /(r) = /(x ) 

(some variables may not be defined any value but none is set both true and false). 
The only possibility for an edge Cc to contain two vertices of the same color is 
that it contains a literal of the same color as r, i.e., every clause c contains a 
positive literal x such that /(x+) = /(r) (and hence </>(x) = true) or it contains 
a negated literal ~^y such that f{y~) = f{r) (and hence (p{y) = false) and -•y 
satisfies c. 

If <P is satisfiable then a coloring using 2m -I- 1 colors can be constructed along 
the lines above. 

Finally for the claim about edge loads, start with a formula such that every 
clause contains two or three literals and every veriable occurs in at most three 
clauses. Such restricted SAT problem is well known to be NP-complete. □ 
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The following theorem states that the hardness of the decision problem of 
the upper chromatic number for mixed hypertrees is caused by C-edges. 

Theorem 4. It is NP-complete to decide ifx{H) > k even for hypertrees H = 

(y,c,0). 

Proof. Given a graph G containing n vertices, define a tree T with n + 1 vertices 
(let w be a new vertex not contained in V{G)) by 

V{T) = V{G) U {w} 



and edges 



E{T) = {vw:v£ V{G)} 



and define a mixed hypertree H on the same vertex set by 



C = {{urcu} : uv G E{G)} 

= 0 

We claim that x(i?) = a{G) + 1 and thus this construction yields a reduction 
from the well known NP-complete problem INDEPENDENT SET. 

Let / be a coloring of H using fc -|- 1 colors. Let Vi, ... ,Vk be vertices of H 
such that f{vi) yf f{vj) for all i ^ j and f{vi) yf f{w) for all 1 < z < fc. If there 
were an edge ViVj, then the edge {viWVj} G C would not be colored properly. 
Thus the vertices vi, . . . ,Vk create an independent set of G and a{G) > k. 

On the other hand, let ui, . . . , be an independent set of G and bi, . . . , bk+i 
be A: -I- 1 different colors. Define a coloring / of iL by /(uj) = bi, 1 < i < k, and 
f{v) = bk+i for all u G P (H) \ {v \, . . . , Vk\. This coloring is strict, since for every 
edge uwv G C either zz or w is different from all Wj, 1 < z < fc (otherwise v\, ... ,Vk 
would not be an independent set), and thus its color is the same as the color of 
w. This implies that x{H) > fc -I- 1. We conclude that a{G) -I- 1 = x{H). □ 



5 Conclusion 

Several questions concerning the computational complexity remain open. E.g., 
we have shown that strict colorability is polynomial if the loads of vertices are 
bounded, while the problem is NP-complete for unbounded vertex loads. The 
complexity of the case when the number of colors is fixed (but the load of vertices 
unbounded) is presently open. 

Another most probably hard problem is to decide if the chromatic spectrum 
of a given mixed hypergraph is continuous. 
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Abstract. Given a graph G, a designated vertex r and a natural number 
k, we wish to find k “independent” spanning trees of G rooted at r, that 
is, k spanning trees such that, for any vertex v, the k paths connecting 
r and v in the k trees are internally disjoint in G. In this paper we 
give a linear-time algorithm to find k independent spanning trees in a 
fc-connected maximal planar graph rooted at any designated vertex. 



1 Introduction 

Given a graph G = (V, E), a, designated vertex r G V and a natural number k, we 
wish to find k spanning trees T\,T 2 , ■ ■ ■ ,Tk of G such that, for any vertex v, the 
k paths connecting r and v in T\,T 2 , ■ ■ ■ ,Tk are internally disjoint in G, that is, 
any two of them have no common intermediate vertices. Such k trees are called 
k independent spanning trees of G rooted at r. Five independent spanning trees 
are drawn in Fig. 1 by thick lines. Independent spanning trees have applications 
to fault-tolerant protocols in networks [1,7,11,15,17]. 

Given a graph G = (V, E) of n vertices and m edges, and a designated vertex 
r G V , one can find two independent spanning trees of G rooted at any vertex 
in linear time if G is biconnected [2,3,11], and find three independent spanning 
trees of G rooted at any vertex in 0{mn) and O(n^) time if G is triconnected 
[2,3,6]. It is conjectured that, for any fc > 1, every fc-connected graph has fc 
independent spanning trees rooted at any vertex [14,20]. For general graphs 
with fc > 4 the conjecture is still open, however, for planar graphs the conjecture 
is verified by Huck for fc = 4 [9] and fc = 5 [10] (i.e., for all planar graphs, since 
every planar graph has a vertex of degree at most 5 [19, p269] means there is 
no 6-connected planar graph). The proof in [10] yields an algorithm to actually 
find fc independent spanning trees in a fc-connected planar graph, but it takes 
time O(n^). On the other hand, for fc-connected maximal planar graphs we can 
find fc independent spanning trees in linear time for fc = 2 [2,3,11], fc = 3 [2,3, 
18] and fc = 4 [15]. 

In this paper we give a simple linear-time algorithm to find five independent 
spanning trees of a 5-connected maximal planar graph rooted at any designated 
vertex. Note that, since there is no 6-connected planar graph, our result, to- 
gether with previous results [2,3,11,15,18], yields a linear-time algorithm to find 
fc independent spanning trees in a fc-connected maximal planar graph rooted 
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Fig. 1. Five independent spanning trees Ti, T 2 , Ta, T 4 and T 5 of a graph G rooted at r. 



at any designated vertex. Our algorithm is based on a “5-canonical decompo- 
sition” of a 5-connected maximal planar graph, which is a generalization of an 
st-numbering [8], a canonical ordering [12], a canonical decomposition [4,5], a 
canonical 4-ordering [13] and a 4-canonical decomposition [15,16]. 

The remainder of the paper is organized as follows. In Section 2 we introduce 
some definitions. In Section 3 we present our algorithm to find five independent 
spanning trees based on a 5-canonical decomposition. In Section 4 we give an 
algorithm to find a 5-canonical decomposition. Finally we put conclusion in 
Section 5. 



2 Preliminaries 

In this section we introduce some definitions. 

Let G = (V,E) be a connected graph with vertex set V and edge set E. 
Throughout the paper we denote by n the number of vertices in G, and we 
always assume that n > 5. An edge joining vertices u and v is denoted by 
(u,v). The degree of a vertex v in G, denoted by d{v,G) or simply by d{v), is 
the number of neighbors of v in G. The connectivity k{G) of a graph G is the 
minimum number of vertices whose removal results in a disconnected graph or 
a single- vertex graph Ki. A graph G is k-connected if k(G) > k. A path in a 
graph is an ordered list of distinct vertices {vi,V 2 , • • • , uj) such that (ui_i, is 
an edge for all z, 2 < z < 1. We say that two paths are internally disjoint if their 
intermediate vertices are disjoint. We also say that a set of paths are internally 
disjoint if every pair of paths in the set are internally disjoint. 




292 S. Nagai and S.-i. Nakano 



A graph is planar if it can be embedded in the plane so that no two edges 
intersect geometrically except at a vertex to which they are both incident. A 
planar graph G is maximal if all faces including the outer face are triangles 
in some planar embedding of G. Essentially each maximal planar graph has 
a unique planar embedding except for the choice of the outer face. A plane 
graph is a planar graph with a fixed planar embedding. The contour Co{G) of a 
biconnected plane graph G is the clockwise (simple) cycle on the outer face. We 
write C'o(G) = {w\,W2, • • • , Wh) if the vertices W\,W2, ■ ■ ■ ,Wh on Go{G) appear 
in this order. 

Let Pi = (ui, V2, • • • , vi) and P2 = {wi,W2, • • • , wp ) be two paths in a plane 
graph sharing a (maximal) subpath Pc = (va, Va+i, ■ ■ ■ ,Vb), where Pc may be a 
vertex, Va-i be the preceding vertex of Pc in Pi, and Vb+i be the succeeding 
vertex of Pc in Pi. If we cannot add an edge e = {wi,wp) to P2 so that ( 1 ) 
{va-i,Vb+i} is located inside of the simple cycle consisting of P2 and e, and ( 2 ) 
e does not intersect to Pi, then we say Pi and P2 cross each other on Pc- 

3 Algorithm 

In this section we give our algorithm to find five independent spanning trees of 
a 5 -connected maximal planar graph rooted at any designated vertex. 



r rt 




Fig. 2. (a) Five-connected plane graph G and (b) plane graph G 



n 



Given a 5 -connected maximal planar graph G = (V,E) and a designated 
vertex r G V, we first find a planar embedding of G in which r is located on 
Go{G). Let G' = G — {r} be the plane subgraph of G induced by E — {r}. 
In Fig. 2 (a) G is drawn by solid and dotted lines, and G by solid lines. We 
may assume that all the neighbors ri,r2, • • • ,rd(r) of r in G appear on Go(G ) 
clockwise in this order. Now Go(G ) = (ri, r2, • • • , rd{r))- Since G is 5 -connected, 
d{r) > 5 . We add to G two new vertices Vb and rt, join rb with ri,r2 and r^, and 
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join rt with r^, r^, - ■ ■ , Let G be the resulting plane graph where vertices 
f’i,rb,r 3 ,r 4 ,rt and appear on Co{G ) clockwise in this order. Fig. 2 (b) 
illustrates G . 

Let n = (FFi, IF 2 , • • • , Wm) be a partition of the vertex set V — {r} of G . We 
denote by Gk, I < k < m, the plane subgraph of G induced by {rt} IJ^i U 
U ■ ■ ■ U ^k- We denote byGfc,0<A:<m— 1, the plane subgraph of G in- 
duced by Wk+i U Wk +2 U • • • U Wm U{^t}- A partition 77 = (IFi, W 2 , • • • , Wm) 
of V — {r} is called a 5-canonical decomposition of G if the following three 
conditions (col)-(co3) are satisfied. 

(col) VFi = {ri,r2,r3}|J{M2,W3,---,Ud(r2)-2}, where vertices U 2 ,U 3 ,---, 
Ud(r2)-2 are the neighbors of ra except ri,r3,rb, and Wm = {?"d(r)-i, ?’<i(r)} 
(co2) For each k, 1 < k < m, Gk is triconnected, and for each k, 0 < k < m—1, 
Gk is biconnected; and 

(co3) For each k, 1 < k < m, one of the following two conditions holds (See 
Fig. 3. The vertices in Wk are drawn in black dots.): 

(a) \Wk\ > 2, all vertices in Wk consecutively appear on Go{Gk) and each 
vertex u G Wk satisfies d{u, Gk) = 3 and d{u, Gk-i) > 3; and 
(b) \Wk\ = 1, and the vertex u G Wk satisfies d{u, Gk) > 3 and d{u, Gk-i) > 2. 




Fig. 3. Two conditions for (co3). 



Fig. 2 (b) illustrates a 5-canonical decomposition of G = G — {r}, where 
G are drawn in solid lines and each set Wi is indicated by an oval drawn in a 
dotted line. A 5-canonical decomposition is a generalization of an “st-numbering” 
[8], a “canonical ordering” [12], a “canonical decomposition” [4,5], a “canonical 
4-ordering” [13] and a “4-canonical decomposition” [15,16]. 

We have the following lemma. We will give a proof of Lemma 1 in Section 4. 
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Lemma 1 . Let G = (V, i?) be a 5-connected maximal plane graph, and let r be a 
designated vertex on C'o(G). Then G = G — {r} has a 5-canonical decomposition 
n. Furthermore U can be found in linear time. 

We need a few more definitions to describe our algorithm. For a vertex v G 

V — {r} we write N{v) = {vi,V2, ■ ■ ■ , Wd(j;)} if vi,V2,' ' ' : are the neighbors 

of vertex u in G and appear around v clockwise in this order. To each vertex 

V € V — {r} we assign five edges incident to in G as the right leg rl(v), the 
tail t{v), the left leg ll{v), the left hand lh{v) and the right hand rh{v) as follows. 
We will show later that such an assignment immediately yields five independent 
spanning trees of G. Let v G for some k, 1 < k < m, then there are the 
following four cases to consider. 

Case 1 : fc = 1. (See Fig. 4(a).) 

Now Wi = {ri, r2, rs} [j{u2, us, • • • , Md(r2)-2}- We may assume that vertices 
U2, U3, ■ ■ ■ , Ud(r2)-2 consecutively appear on Go(Gi) clockwise in this order. 
Let ui = r3,uo = rb,Ud(r2)-i = ri and Ud{r2) = n- For each m €Wi - {r2} 
we define rl{u^) = (ui,Ui+i), t{ui) = (u^,r2), U{ui) = {ui,Ui_i), lh{ui) = 
(ui,vi), and rh{ui) = {ui,Vd{ui)-3) where we assume N{ui) = {ui-i, vi,V2, 
■■■, Vd{ui)-3, Ui+i,r2}. For r2 we define rl{r2) = (r2,ri), t{r2) = (r2,rt), 
U{r2) = (r2,rs), lh{r2) = (r2,U2), and rh{r2) = (r2, Ud(r2)-2)- 
Case 2: Wk satisfies Condition (a) of (co3). (See Fig. 4(b).) 

Let Wk = {ui,U2, • • • , ui\ and Ui,U2, ■ ■ ■ ,ui consecutively appear on Co{Gk) 
clockwise in this order. Since d{ui,Gk) = 3 for each vertex Ui and G is 
maximal planar, vertices ui,U2, - ■ ■ ,ui have exactly one common neighbor, 
say V, in Gfc. Let uq be the vertex on Go{Gk) preceding u\, and let m;+i be 
the vertex on Co{Gk) succeeding U[. For each m G Wk we define rl{ui) = 
(Ui,u^+i), t{ui) = (Ui,v), ll{ui) = lh{uf} = (u^,Vi), and rh{ui) = 

(ui, Vd(ui)-3) where we assume N{ui) = {u^-i,vi,V2, • • • , Vd(ui)-3, Ui+i,v}. 
Case 3: Wk satisfies Condition (b) of (co3). (See Fig. 4(c).) 

Let Wk = {m}, let u be the vertex on Co{Gk) preceding u, and let u 
be the vertex on Go{Gk) succeeding u. Let N{u) = {u , vi, f2, • • • , fd(ii)-i}, 
and let u = Vx for some x, 3 < x < d{u) — 2. Then rl{u) = {u,u ), 
t{u) = {u,Vd(u)-i), ll{u) = {u,u), lh{u) = {u,vi), and rh{u) = {u,Vx-i). 
Case 4 : k = m. (See Fig. 4(d).) 

Now Wm = {rd(r)-i,rd{r)}- Let uq = rt, ui = rd(r)-i, «2 = rd(r) and 
U3 = rt. For each m G Wk we define rl{ui) = (ui,vi), t{ui) = (m*, Ud(„,)_s), 
ll{ui) = {ui,Vd(ui)-2), lh{ui) = (■Uj,'Uj_i), and rh{ui) = {u^,u^+i) where we 
assume N{ui) = {u,+i,Vi,V2, - ■ ■ ,Vd(ui)-2,Ui-i}- 

We are now ready to give our algorithm. 

Procedure FiveTrees(G, r) 
begin 

1 Find a planar embedding of G such that r G Go(G); 

2 Find a 5-canonical decomposition U = (Wi,W2, ■ ■ ■ , Wm) of G — {r}; 
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3 For each vertex v & V — {r} find rl{v),t{v), ll{v), lh{v) and rh{v); 

4 Let Tri be a graph induced by the right legs of all vertices in V — {r}; 

5 Let Tt be a graph induced by the tails of all vertices in ^ — {r}; 

6 Let Til be a graph induced by the left legs of all vertices in y — {r}; 

7 Let Tih be a graph induced by the left hands of all vertices in y — {r}; 

8 Let Trh be a graph induced by the right hands of all vertices in y — {r}; 

9 Regard vertex rj, in trees Tri, Tt and Tu as vertex r; 

10 Regard vertex rt in trees Tih and Trh as vertex r; 

11 return Tri,Tt,Tu,Tih and Trh as five independent spanning trees of G. 
end 

We then verify the correctness of our algorithm. Assume that G = {V, E) is 
a 5-connected maximal planar graph with a designated vertex r £ V, and that 
Algorithm FiveTrees finds a 5-canonical decomposition TT = (IFi, 14^2, • • • , Wm) 
of G—{r} and outputs Tri,Tt, Tu,Tih and Trh- We first have the following lemma. 

Lemma 2. Let 1 < k < m, and let T^i be a graph induced by the right legs of 
all vertices in Gk — {rt}- Then T^i is a spanning tree of Gk- 

Proof. We can prove the claim by induction on T. □ 

We then have the following lemma. 

Lemma 3. Tri,Tt,Tu,Tih and Trh are spanning trees of G. 

Proof. By Lemma 2, Tff is a spanning tree of Gm, and hence Tri in which rj, is 
regarded as r is a spanning tree of G. 

Similarly Tt,Tu,Tih and Trh are spanning trees of G. □ 

Let V be any vertex in y — {r}, and let Pri, Pt, Pa, Pih and Prh be the paths 
connecting r and v in Tri,Tt,Tu,Tih and Trh, respectively. For any vertex u 
in y — {r} we define rank{u) = k if u £ Wk- For r, ranker) is undefined. If 
an edge (v,u) of G is either a leg or a tail of vertex v, and (v,w) of G is a 
hand of v, then rank{u) < rank{v) < rank{w), and additionally if u yf r 2 then 
rank{u) < rank{w). Note that if {v,u) is in G neither v nor u is r. Also note 
that only if v = r 2 then rank{u) = rank(w). (See Fig. 4.) Now we have the 
following lemma. 

Lemma 4. Every pair of paths Pi G {Pri,Pt,Pa} and P 2 G {Pih,Prh} are 
internally disjoint. 

Proof. We prove only that Pri and Prh are internally disjoint. Proofs for the other 
pairs are similar. If u = ri, then Pri = (ri, r) and Pri is internally disjoint to any 
path, since Pri has no intermediate vertices. If u = ra(r), then Prh = {Td{r)T) and 
similarly Prh is internally disjoint to any path. If u = r 2 , then Pri = (r 2 ,ri,r), 
so only ri is the intermediate vertex on Pri, Prh = (v, Ud(r 2 )- 2 ; ' ' •)> and, for each 
intermediate vertex Vi except Ud(r 2)-2 on Prh, rank{vi) > 2. Therefore Pri and 
Prh are internally disjoint if u is r 2 . Thus we may assume that v yf r\,r 2 ,rd(r). 
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Let Pri = {v,v\,V2, - ■ ■ ,vi,r), then vi = ri. Let Prh = {v,ui,U2, ■ ■ ■ ,Ui' ,r), then 
Uf = The definition of a right leg implies that rank{v) > rank{v\) > 

rank{v2) > ■■■ > rank{vi), and the definition of a right hand implies that 
rank{v) < rank{u\) < rank{u2) < ••• < rank{ui>). Thus rank{vi) < ■■■ < 
rank{v2) < rank{vi) < rank{v) < rank{u\) < rank{u2) < • • • < rank{uf). We 
furthermore have rank{v\) < rank{u\) since v ^ r2- Therefore Pri and Prh are 
internally disjoint. □ 

If rl{v) = (v,u) then we say (v,u) is an incoming right leg of u. Similarly, if 
t(y) = {v,u) then {v,u) is an incoming tail of u, and if ll{v) = (v,u) then (v,u) 
is an incoming left leg of u. 

We have the following lemma. 

Lemma 5. Let u G V — {r}. All incoming right legs of u appear consecutively 
around u, all incoming tails of u appear consecutively around u, and all incoming 
left legs of u appear consecutively around u. Furthermore ll(u), the incoming 
right legs, incoming tails, incoming left legs and rl(u) appear clockwise around 
u in this order. 

Proof. (See Fig. 5.) Let u G V — {r}, ll{u) = {u,u ), rl{u) = {u,u ), and 
N{u) = {vq,vi, ■ ■ ■ One may assume that u = vq and u = Vz for 

some z, 3 < z < d{u) — 2. Here it is possible that rl{vo) = (vo,u) and/or 
ll{vz) = {vz,u). 




Fig. 5. Illustration for Lemma 5. 



If w = T 2 then the claim is clearly holds. (In this case there is no incoming 
legs of u.) Thus we assume u ^ r2- 

If (ui,u) is the tail of Ui G Wk then u G Co(Gk-i) and u ^ Co(Gk). (See 
Fig. 4.) Thus if t(ui) = (ui, u) and t{uj) = (uj,u) then {ui, Uj} C Wk for some k. 
Therefore all incoming tails of u appear consecutively around u. (See Fig. 4(b).) 

If 1 < i < z—1, rl{vi) = (vi, u) and Vi G Wk, then (vi-i,u) ^ Go{Gk), and eit- 
her t{u) = rl{vi-i) = (vi-i,u) or ll{u) = {u,Vi-\) hold, {li rankfvi) = 
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rank{u) then t{u) = {u,Vi-\). Otherwise rank{vi) > renk{u), and assume 
rank{vi) = k. Now edge (vi-i,u) is on Co{Gk-i)- If rank{vi-\) < rank{u) 
then ll{u) = (u,Vi-i). If rank{vi-i) > rank{u) then rl{vi-i) = {vi-i,u). See 
Fig. 4.) Thus if u has an incoming right leg e then the edge preceding e around 
u clockwise is either an incoming right leg of u, t{u) or ll{u). Since t{u) and 
ll{u) always appear consecutively around u, therefore all incoming right legs of 
u appear consecutively around u and ll{u) precedes them. Similarly all incoming 
left legs of u appear consecutively around u and rl{u) succeeds them. Thus the 
claim holds. □ 

Lemma 5 immediately implies the following lemma. 

Lemma 6. A pair of paths P\,P 2 G {Pri,Pt,Pii} may cross at a vertex u, but 
do not share a vertex u without crossing at u. 

From the definitions of a left leg , a tail and a right leg one can immediately 
have the following lemma. 

Lemma 7. Let I < k < m, u ^ r 2 and u G Wk- Then u is on Co{Gk)- Let u 
be the succeeding vertex of u on Go{Gk)- Assume that N{u) starts with u . Let 
rl{u) = {u,v ), t{u) = {u,v ) and ll{u) = {u,v ). Then v , v , v appear in 
N{u) in this order. 

We then have the following lemma. 

Lemma 8. A pair of paths Pi,P 2 G {Pri, Pt, Pu} are internally disjoint. Also 
Pih,Prh are internally disjoint. 

Proof. We prove only that Pri and Pu are internally disjoint. Proofs for the 
other cases are similar. Suppose for a contradiction that Pri and Pu share an 
intermediate vertex. Let w be the intermediate vertex that is shared by Pri and 
Pu and appear last on the path Pri going from r to v. Now w ^ T 2 because w is 
an intermediate vertex of Pri and Pu, and r 2 has degree one in both Tri and Tu- 
Then Pri and Pu cross at w by Lemma 6. However, the claim in Lemma 7 holds 
both for k = rank(v) and u = v and for k = rank{w) and u = w, and hence Pri 
and Pu do not cross at w, a contradiction. □ 

By Lemmas 4 and 8 we have the following lemma. 

Lemma 9. Tri,Tt,Tu,Tih and Tru are five independent spanning trees of G 
rooted at r. 

Clearly the running time of Algorithm FiveTrees is 0{n). Thus we have the 
following theorem. 

Theorem 1. Five independent spanning trees of any 5-connected maximal pla- 
nar graph rooted at any designated vertex can be found in linear time. 
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4 Proof of Lemma 1 

In this section we give an algorithm to find a 5-canonical decomposition. Then 
we show it runs in linear time. First we need some definitions. 

Let G = (V,E) be a 5-connected maximal plane graph, r be a designa- 
ted vertex on C'o(G). We construct G from G as in Section 3. Let iL be a 
triconnected plane subgraph of G" such that (1) iL is a subgraph of G in- 
side some cycle G of G (including G), and (2) rb,n,r 3 G Go{H) = G. Let 
Go{H) = {n = wi,W 2 = rs,, - ■ ■ ,wi = ri). 

A set of edges (vi,u), (u2, «),•••, (vh, u) in H is called a fan with center u 
if (1) u ^ Go{H), (2) the neighbors of u on Go{H) are Ui, U2, • • • , called 
leaves, and they appear in Go{H) clockwise in this order, (but may not appear 
consecutively,) and (3) either (i) h = 2 and H does not have edge{vi,V 2 ), or 
(ii) h > 3. Assume a set of edges {vi,u), {v 2 ,u), ■ ■ ■ , (vh,u) is a fan F with 
center u. Now, ior 1 < i < h — 1, Vi = Wa and Vi+\ = Wb hold for some a, 
b such that 1 < a < b < I, and let Gi be the cycle consisting of the subpath 
{wa, Wa+ 1 , • • • , Wb) of Go{H) and two edges {wb, u), (u, Wa)- Each plane subgraph 
Fi of H inside Gi (including Gi) is called a piece of F. Fi is called an empty piece 
if a -I- 1 = 6. If Fi is an empty piece then Gi is a triangle face of FI. (Since G is 
5-connected, if a -I- 1 = 6 then Fi has no vertex in the proper inside.) Note that 
by the definition if a fan has exactly two leaves then it has exactly one piece 
and the piece is not empty. Also note that F has exactly ft. — 1 pieces. If none of 
pieces of F contains a distinct fan, then F is a minimal fan. 

A cut-set is a set of vertices whose removal results in a disconnected graph. 
Since G is 5-connected and maximal planar, every cut-set of ift consisting of three 
vertices has (1) exactly one vertex not on Go(ift) and (2) exactly two vertices 
on Go(ift). Thus each cut-set of ift consisting of three vertices corresponds to a 
center of a fan and its two leaves. 

We have the following lemmas. 

Lemma 10. If a vertex v G Go{H) is on none of fans of FI (Note that, however, 
V may be contained in a piece of a fan.), then FI — {u} is triconnected, where 
H — {u} is the plane subgraph of H obtained from H by deleting v and all edges 
incident to v. 



Lemma 11. If all pieces of a fan F = {vi,u),{v 2 ,u), - ■ ■ , {vh, u) of H is empty 
(Now d{vi) > 4, d{vh) > 4 and, for j = 2, 3, • • • , ft — 1, d{vj) = 3.) and u yf r2, 
then H — {v 2 , W3, • • • , Vh-i} is triconnected, where H — {v 2 , U3, • • • , Vh-i} is a 
plane subgraph of H obtained from H by deleting U2, V3, • • • , Vh-i and all edges 
incident to them. 

Now we give our algorithm to find a 5-canonical decomposition. 

First, by Condition (col) we can find Wm- Now G^-i is biconnected since 
Gm-i is a triangle cycle. Since G = (V, F) is 5-connected, the vertex set V — {r} 
induces a 4-connected graph G'. And Gm is obtained from G' by adding a new 
vertex rb adjacent three vertices of G'. Now Gm is triconnected since a graph 
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obtained from a fc-connected graph G by adding a new vertex adjacent k vertices 
of G is also fc-connected [19, pl45]. Also Gm-i is triconnected, since otherwise 
Gm-i has a cut-set S with two or less vertices and then is a cut-set of 

G with four or less vertices, a contradiction. Thus for k = m — 1 and m, Gk is 
triconnected, and for fc = to — 1, Gk is biconnected. Clearly ri,r 2 ,rs ^ Wm- 

Then, inductively assume that we have chosen such 

that for each fc = i, t -|- 1, • • • , to, Gk is triconnected, and for each fc = i,i + 
1, • • • , TO — 1, Gk is biconnected, ri , r 2 , ra ^ Wm U U ’ ’ ’ U ^i+i and each 
Wk, fc = i -I- 1, i -I- 2, • • • , TO, satisfies either (col) or (co3). Now we can choose 
Wi as follows. We have two cases. If Gt has exactly one vertices in the proper 
inside of Gi then it is r 2 and we have done by setting all vertices in Gi except 
rb as Wi- Otherwise we can find Wi C V — Wm U ^m-i U ' ' ' U ^i+i such that 
(1) Gi-i is triconnected, (2) Gi_i is biconnected, (3) ri,r 2 ,r 3 ^ Wi, and (4) Wi 
satisfies (co3), as follows. 

Let F = {v\,u),{v 2 ,u),- ■ ■ ,{vh,u) be a minimal fan of Gi. Note that Gi 
always has a fan (r^, r 2 ), (ra, r 2 ), • • • , (ri, r 2 ) with center T 2 implies Gi always 
has a fan. 

If every piece of F is empty then F has three or more leaves, and we can set 
Wi = {v 2 , Vs, - ■ ■ , Vh-i}. Now if fc > 4 then Wi satisfies (a) of (co3) and is 
triconnected by Lemma 11, and Gi_i is biconnected since each vertex in Wi has 
degree exactly three in Gi means each vertex in Wi has two or more neighbors 
in Gi- Similarly if fc = 3 then Wi satisfies (b) of (co3), and Gi_i is triconnected 
by Lemma 11, and Gi_i is biconnected as above. 

Otherwise, let F' be a non-empty piece of F. Now F' has four or more 
vertices on Go{Gi) since otherwise G has a cut-set with four or less vertices, a 
contradiction. Now there exists at least one vertex of F' on Go(Gi) such that (1) 
it is not a leaf of F, and (2) it has two or more neighbors in Gi. (Since otherwise 
each vertices of F' on Go(Gi) except the two leaves Wa, Wb of F has at most one 
neighbor in Gi, and for G is maximal planar each neighbor in Gi is a common 
vertices, say x, and {u, Wa, Wb, x} forms a cut-set, a contradiction.) Thus we can 
find Wi satisfying (b) of (co3). Now Gi_i is triconnected by Lemma 10, and 
Gi-i is biconnected. 

Thus we can find a 5-canonical decomposition. By maintaining a data struc- 
ture to keep fans and the number of neighbors in Gi for each vertex, the algorithm 
runs in linear time. 

5 Conclusion 

In this paper we give a linear-time algorithm to find fc independent spanning 
trees of a fc-connected maximal planar graph rooted at any designated vertex. 
It is remained as future work to find a linear-time algorithm for planar graphs, 
which are not always maximal planar. 
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Abstract. We study the problem of designing fault-tolerant routings 
with small routing tables for a fc-connected network of n processors in 
the surviving route graph model. The surviving route graph R{G,p)/F 
for a graph G, a routing p and a set of faults C is a directed graph 
consisting of nonfaulty nodes with a directed edge from a node x to a 
node y iff there are no faults on the route from x to y. The diameter of 
the surviving route graph could be one of the fault-tolerance measures 
for the graph G and the routing p and it is denoted by D{R{G, p)/F). 
We want to reduce the total number of routes defined in the routing, and 
the maximum of the number of routes defined for a node (called route 
degree) as least as possible. In this paper, we show that we can construct 
a routing A for every n-node fe-connected graph such that n > 2k^ , in 
which the route degree is 0{ky/n), the total number of routes is 0{k^n) 
and D(R{G, X)/ F) < 3 for any fault set CdT] < k). We also show that 
we can construct a routing pi for every n-node biconnected graphs, in 
which the total number of routes is 0{n) and D{R{G, pi)/{f}) < 2 for 
any fault /, and using pi a routing p 2 for every n-node biconnected 
graphs, in which the route degree is 0(y/n), the total number of routes 
is 0{ru/n) and D{R(G, p 2 )/{f}) < 2 for any fault /. 



1 Introduction 

Consider a communication network or an undirected graph G in which a limited 
number of link and/or node faults F might occur. A routing p for a graph defines 
at most one path called route for each ordered pair of nodes. A routing is called 
minimal-length if any route from x to y is assigned to a shortest path from x to 
y. We assume that it must be chosen without knowing which components might 
be faulty. 

Given a graph G, a routing p and a set of faults F, the surviving route graph 
R{G,p)/F is defined to be a directed graph consisting of all nonfaulty nodes in 
G, with a directed edge from a node x to a node y iff the route from x to y is 
intact. The diameter of the surviving route graph (denoted by D{R{G, p)/F)) 
could be one of the fault-tolerance measures for the graph G and the routing p 
[2,4]. The routing p on G is called {d, f) -tolerant if D{R{G,p)/F) < d for any 
set F with at most / faults. 
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When we consider the fault tolerance of ATM and/or optical networks, rou- 
tings must satisfy several constraints such as the number of routes defined for 
a node {route degree of the node) and the total number of routes defined in 
the routing [12]. Since the size of the routing table is dominated by the route 
degree of the node and the edge-load of the routing (the maximum number of 
routes passing through the edge over all edges) is dependent of the total number 
of routes, the route degree and the total number of routes should be as least as 
possible [3]. 

Many results have been obtained for the diameter of the surviving route 
graph [6,10,11,13]. As long as we use minimal-length routings for general gra- 
phs, we can not expect good behavior for the diameter of the surviving route 
graph, say constant diameter [4]. It is also shown that the graph connectivity 
does not help to reduce the diamter of the surviving route graph if minimal- 
length routings are considered [4]. Therefore, we must consider non- minimal- 
length routings to obtain efficient fault-tolerant ones for /c-connected graphs. 
For n-node /c-connected graphs, a (5, fc — l)-tolerant routing and a (3, fc — 1)- 
tolerant routing can be constructed if n > and n > 2fc^, respectively [9]. A 
(2, fc— l)-tolerant routing can be constructed for every n-node /e-connected graph 
such that n > 7fc^]"log2n] [15]. However, in these routings, the route degree of 
most nodes is n — 1 and it is undesirable. Stronger results have been known for 
n-node biconnected graphs[12]; We can construct a (2, l)-tolerant routing with 
0(n) routes and a (3, l)-tolerant routing with 0(n) routes and route degree 
0{^/n) [12]. It can be shown that the former routing is optimal in the sense that 
the diameter of the surviving route graph is minimum and the total number of 
routes in the routing is minimum, and the diameter of the surviving route graph 
for the latter routing is also optimal among routings with route degree 0{y/n) 
and the total number of route in the routing is minimum. 

In this paper, we show the following results which improve the previous ones 
with respect to the route degree and the total number of routes. 

1. For every n-node fc-connected graph such that n > 2/c^, we can construct a 
(3, k — l)-tolerant routing A in which the route degree is 0{k^/n) and the 
total number of routes is 0{k^n). 

2. For every n-node biconnected graph, we can construct a (2, l)-tolerant rou- 
ting Pi in which the total number of routes is 0(n) and a (2, l)-tolerant 
routing p 2 in which the route degree is 0{y/n) and the total number of 
routes is 0{n^Jn). 

We improve the (3, k — l)-tolerant routing shown in [9] to the routing A 
so that the route degree of A is reduced to 0{k^/n) with preserving the total 
number of routes. We also show that the diameter of the surviving route graph 
for A is optimal among routings with the route degree 0{ky/n) if n > 2fc^ and 
k = o(n^/®). 

The routing p\ does not improve the previous result. However, the idea to 
define p\ is different from the previous ones and it induces the routing p 2 that is 
the first (2, l)-tolerant routing with route degree 0{y/n) for biconnected graphs. 




304 



K. Wada and W. Chen 



We also show that the total number of routes in the routing p 2 is minimum 
among (2, l)-tolerant routings with route degree 0{y/n). 

2 Preliminary 

In this section, we give definitions and terminology. We refer readers to [8] for 
basic graph terminology. 

Unless otherwise stated, we deal with an undirected graph G = (V, E) that 
corresponds to a network. For a node v of G, Nc{v) = {■u|(w,m) G E} and 
degciv) = |iV( 5 (v)|. dega{v) is called degree of v and if G is apparent it is simply 
denoted by deg{v). For a node set U C V, the subgraph induced by U is the 
maximal subgraph of G with the node set U and denoted by G < C/ >. A 
graph G is k-connected if there exist k node-disjoint paths between every pair of 
distinct nodes in G. For a node v £ V and a node set U C V — {w}, v-U fan is 
a set of \U\ disjoint paths from v to all nodes of U. Usually 2-connected graphs 
are called biconnected graphs. 

The distance between nodes x and y in G is the length of the shortest path 
between x and y and is denoted by disc{x, y) . The diameter of G is the maximum 
of discix, y) over all pairs of nodes in G and is denoted by D{G). Let P{u, v) and 
P{v, w) be a path from u to v and a path from v to w, respectively. In general, 
even if both P(u,v) and P{v,w) are simple, the concatenation of P{u,v) and 
P{v,w) is not always simple. Thus we consider two kinds of concatenation: one 
is a usual concatenation (denoted by P{u, v) ■ P{v, w)) and the other is a special 
concatenation (denoted by P{u,v) © P{v,w)), which is defined as the shortest 
path from m to w in the graph P{u, v) U P(v, w) to make the concatenated path 
simple. 

Let G = (U, E) be a graph and let x and y be nodes of G. Define Pg{x, y) to 
be the set of all simple paths from the node x to the node y in G, and P{G) to be 
the set of all simple paths in G. A routing is a partial function p :V xV P{G) 
such that p{x,y) G PG{x,y){x yf y). The path specified to be p{x,y) is called 
the route from x to y. 

For a graph G = (U, A), let F C U U if be a set of nodes and edges called 
a set of faults. We call E nV{= Ey) and E n E{= Fe) the set of node faults 
and the set of edge faults, respectively. If an object such as a route or a node set 
does not contain any element of F, the object is said to be fault free. 

For a graph G = (U, E), a routing p on G and a set of faults F{= Fy U Fe), 
the surviving route graph, R{G,p)/F, is a directed graph with node set V — Ey 
and edge set E{G,p,F) = {< x,y > \p{x,y) is defined and fault free}. In 
what follows, unless confusion arises we use notations for directed graphs as the 
same ones for undirected graphs. 

In the surviving route graph i?(G, p)/F, when F = <p the graph is called the 
route graph. In the route graph, the outdegree of a node v is called the route 
degree of a node v and the maximum of the route degree of all nodes is called the 
route degree of the routing p. The number of directed edges in the route graph 
corresponds to the total number of routes in the routing p. If the number of 
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edges in the route graph is m, the routing p is called m-route-routing or simply 
m-routing. 

For a graph G = {V, E) and a routing p, if for any edge {x, y) in G such that 
p{x, y) is defined, the route p{x, y) is assigned to the edge, p is called edge-routing. 

A routing p is bidirectional if p{x,y) = p{y,x) for any node pair (x,y) in 
the domain of p. If a routing is not bidirectional, it is called unidirectional. Note 
that if the routing p is bidirectional, the surviving route graph R{G,p)/F can 
be represented as an undirected graph. 

Given a graph G and a routing property P, a routing p on G is optimal with 
respect to P if maxps t.\F\<k {D{R{G , p) / F)) is minimum over all routings on G 
satisfying P. Note that from the definition of the optimality, if D{R{G, p)/ F) is 
2 for any set of faults F such that |P| < k, the routing is obviously optimal with 
respect to any property. If the property P is known, we simply call the routing 
is optimal. 

Lemma 1. [1] Let G = (V,E) be an n-node directed graph. If the maximum 
outdegree of G is d and the diameter of G is p, then \E\ = L2{n^ / dP~^) . 

The next theorem can be derived from this lemma. 

Theorem 1. Let G be an n-node graph and let / > 0. There does not exist a 
{2, f) -tolerant o{ny/n) -routing with the route degree 0{^/n) on G. 



3 Optimal Routing for fc-Connected Graphs 

In this section, we show that for n-node /c-connected graphs with n > 2fc^, we 
can construct a (3,/c — I)-tolerant bidirectional edge-routing A such that the 
route degree is 0(fci/n) and the total number of routes in A is O(fc^n) and we 
show that the routing A is optimal with respect to the route degree of 0{k^/n) 
if n > 2k^ and k = o(n^/®). 

The routings for fc-connected graphs are based on the following two properties 
[9,11,15]. 

Lemma 2. [8] Let G = {V,E) be a k-connected graph. Let U be any node set of 
V such that \U\ = k and let v be any node in V — U. Then there is a v-U fan. 

Let u € U. The path from u to n in v-U fan is denoted by Pfan(v,u; U). If 
there is an edge between v and u in U, the path Pfan{v, u; U) in the v-U fan can 
be changed to this edge [4]. 

Lemma 3. [7] Let G = (V, E) be a k-connected graph. Let V\,V 2 , . ■ . ,Vk be di- 
stinct nodes and 01 , 02 , . . . ,Ok be positive integers such that Sb^.^ai = \V\. Then 
there is a partition Vi,V 2 ,...,Vk of V such that Vi € If, \Vi\ = at and the 
induced subgraph G < Vi > is connected for i = 1,2, ... ,k. 
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Vi 



V,{\V,\=n/k) 



Vk 




Fig. 1. The partition of nodes in G. 



First we consider fc-connected graphs with at least nodes, and later we 
extend the result for /c-connected graphs with at least nodes. 

Let G = (V,E) be an n-node fc-connected graph such that n > 4k^. From 
lemma 3, there are k disjoint node subsets Vi, V 2 , . . . , 14 such that \Vg\ = n/k 
^ and the induced graph G < Vg > is connected for g = 1,2,. . . ,k. Let Ug be 
a subset of Vg such that |C/g| = i/n and let each Ug be partitioned into yjnj2k 
sets with each 2k nodes. These sets are denoted by U[g,P\{l < I < v^/2fc). 
Furthermore, each U[g,£] is partitioned into two sets with cardinalities k. These 
sets are denoted by U[g,t, 0] and U[g,t, 1]. Let W = V — Ug^^Ug and let W be 
partitioned into ^/n/2k sets. These sets are denoted by Wi,W2, ■ ■ ■ W^/2k- The 
partition of V is shown in Fig. 1. 

A bidirectional routing A is defined as follows: 

1. For X and y G U[g,£; 0](1 < g < k,l < £ < v4i/2fc), 

A(x, y) = X{y, x) = Pfan{x, y; U[g, £; 0]). 

2. For x,y € Ug{l <g<k), 

\{x, y) = \{y, x) =a shortest path between x and y in G < Vg >. 

3. For X € U[gi,£; i] and y G U[g2, £, *] 

92 < 9i k, 1 < £ < y/nj2k, i = 0, 1) 

A(x, y) = X{y, x) = Pfan{x, y; U[y2, £', U])- 

4. For X G U[9i,£-,ii] and y G U[y2,£,i2] 

< 92 < 9i < k, 1 < £ < y/nj2k, i\ yf 12) 

X{x, y) = X{y, x) = Pfan{y, x; U[9i,£-, ii]). 



^ For simplicity, we assume that n/k and ^/n/2k are integers. 
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Fig. 2 shows the routes in A. Fig. 2 does not show £i; 0] fan 

{z' G U[g 2 ,(-i] 1]) and y'-U[g 2 ,(.i] 1] fan(y' G U[g\,£i] 1]) for the lack of space. 
Because of the property of v-U fan, A is an edge-routing. It can be verified that 
the route degree of A is {2k + l)\/n -I- 2fc^ — 2fc — 1 = 0{k^/n) and the total 
number of routes in A is {k"^ + 2k)n + {k^ — 2k"^ — k)^/n = 0{k"^n) 

From the definition of A, the next lemma holds. 

Lemma 4. Let G = {V, E) be a k-connected graph on which the routing A is 
defined. For any node x and any node set Ug such that x ^ Ug, there are k node 
disjoint routes from x to k nodes in Ug. 

Theorem 2. Let G = (V, E) he an n-node k-connected graph such that n > 4k^. 
The routing X on G is ( 3, k-1) -tolerant. 

Proof. Let F be a faulty set with |F| < k and let R = R{G, X)/F. Since |F| < k 
and the number of node sets Vg is k, there is a node set Vj such that G < Vj > 
contains no elements of F. Let x and y be arbitrary non-faulty distinct nodes in 
V-F. 

[Case:l] Suppose that x,y G Uj. Since Uj C V/ and the route \{x,y) is 
defined in G < V/ >, X{x,y) is fault free. Thus, disn{x,y) = 1. 

[Case: 2] Suppose that x G Uj and y ^Uj. From Lemma 4, there are k node 
disjoint routes between y and k nodes in Uj. Since |F| < k, there is a fault-free 




Fig. 2. The routing A. 
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route between y and an node, say w, in [//. Since \{x,w) does not contain any 
fault in F, disR{x,y) = 2. 

The case that x ^ Uj and y ^ Ui can be proved similar to Case:2 by using 
Lemma 4. 

For an n-node /c-connected graph, if 2k"^ < n < 4/c^, set \Ug\ = 2k instead 
of \Ug\ = in the definition. In this case the total number of routes in A is 
0{k'^) = 0{k"^n) and the route degree of A is 0{k'^) = 0{k^Jn). Therefore, the 
following theorem holds. 

Theorem 3. Let G = (V, E) be an n-node k-connected graph. If n > 2kf , we 
can construct a (3,fc — l)-tolerant bidirectional edge-routing such that the total 
number of routes is 0{k^n) and the route degree is 0{k^/n). This routing is 
optimal with respect to the route degree of 0{k^/n) if n> 2k“^ and k = o(n^/®). 

Proof. The optimality of A can be shown as follows. If fc = o(n^/®) the least 
number of routes in (2, k — l)-routings with route degree of 0{k^/n) is 
from Lemma 1. On the other hand, if A: = o(n^/®), the total number of routes in 
A is o(n^/^). 

In the case that k = 2, we can construct a (3, 1)-tolerant bidirectional edge- 
routing such that the total number of routes is 0(n) and the route degree is 
0{y/n) for any n-node biconnected graph with n > 5 [12]. 

4 Optimal Routings for Biconnected Graphs 

As long as the authors have known, there do not exist (2, A: — l)-tolerant routings 
for fc-connected graphs such that their route degree are 0{k^/n). Although the 
case that A: > 3 is still open, we show that we can construct a (2, l)-tolerant 
edge-routing for biconnected graphs such that its route degree is 0{^/ri) and the 
total number of routes is 0{n^Jn). This routing is optimal because it is (2, 1)- 
tolerant. From Theorem 1, in order to define (2, l)-tolerant routings with route 
degree 0{^Jn), the total number of routes must be I2{n^Jn). Thus, the total 
number of routes in the routing shown here attains the lower bound. 



4.1 Paths by Using s-t Numbering 

Optimal routings for biconnected graphs are based on s-t numberings [5] which 
characterize biconnected graphs [12,14]. 

Given an edge (s, f) of a biconnected graph G = (V,E), a bijective function 
g : V ^ {0, 1, . . . , |U| — 1 = n — 1} is called an s-t numbering if the following 
conditions are satisfied: 

1. g{s) = 0, g{t) = n — 1 and 

2. Every node v £ V — {s,t} has two adjacent nodes u and w such that g{u) < 
giy) < g{w). 
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In what follows, we assume that the node set of G is s-t numbered and it is 
denoted by {0, 1 , . . . , n — 1}, where s = 0 and t = n — 1. 

For a node v in G, we define two paths Pi[v, t] and Pd[v, s] as follows: 

(1) Pi[v,t] = {vo{= v),Vi,...,Vp{= t)), 
where Vi = max{u\u € NG(vi-i)}(l < i < p) and 

(2) Pd[v,s] = {vo{= v),Vi,...,Vq{= s)), 
where Vi = min{uju G A^G(fi-i)}(l < * < ?)- 

Since we treat unidirectional routings, we consider directions for undirected 
paths. Therefore, for example, Pi[v,t] denotes the path from v to t and the path 
from t to u of the same one is denoted by P/[t, v]. 

Note that if (v,s) and (v,t) are in E, Pjj[v,s] = (v,s) and Pi[v,t] = (v,t) 
from the definition. 

From the definition of the s-t numbering, two paths Pi[v, t] and Pd[v, s] are 
well defined and Pi[x,t] and Pd[x,s] are node-disjoint for any node s,t). 

We define the following concatenated paths with P/s and Pds. Let x and y be 
arbitrary distinct nodes. Ps[x,y], Pt[x,y] and Pst[x,y] are defined as Pd[x,s] © 
Pd [s, y] , Pi [x, t] 0 Pi [t, y] and Pd [x, s]-{s,t)- Pi [t, y] (if x < y) and P/ [x, t] ■ {t, s) ■ 
Pd [ siy] (if X > y), respectively, and they are called s-path, t-path and st-paths, 
respectively (Fig. 3). 



st-path:PD[a;, s] • (s,t) ■ Pi[t,y]{x < y) 




Fig. 3. Three kinds of paths, s-path, t-path and st-path. 



4.2 Optimal Routing p 

Let G = (V, E) be a biconnected graph with n nodes. Assume that n > 18 and 
V is divided into groups of 18 nodes each and the last group made up of 

the remaining (n mod 18) nodes. Each group except the last one is divided into 
two parts with 9 nodes each. 

For a node v € V, let q = v div 18, r = v mod 18, g = r div 9 and £ = 
r mod 9. Each node v is represented as [q;g,{i,j)]j where 0 < <7 < [n/18j, 
5 = 0, 1 and (i,j) is the ternary representation of ^ (0 < i,j < 2). 
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II 

o 


iy — 1 


iy ~ 2 


o 

II 


Ps[x,y] 


Pst[x,y] 


Pt{x,y] 


\jx — 1 


Pt[x,y] 


Ps[x,y] 


Pst[x,y] 


II 


Pst[x,y] 


Pt[x,y] 


Ps[x,y] 



where x = [q^-,g^, jr^)] y = Wy; 9y, i^vJy)]- 



Fig. 4. The routing p. 



We define the routing p for G based on the ternary representation (z,j) of 
£ of each node as follows: Let x and y be represented as [qx', 9x, (ix, jx)] and 
[qy] Py, {iy, jy)] 5 respectively. The route p{x,y) is defined as shown in Fig. 4. 
The route from a; to y is determined based on and iy. For example, if jx = 0 
and iy = 2 then the f-path from x to y is used to define p{x, y) and jx = 2 
and iy = Q then the st-path from x to y is used to define p{x,y) and so on. 
The routing p is well-defined. It is a unidirectional and n{n — l)-routing and its 
route-degree is n — 1. 

The following lemma is crucial to show that the routing p is (2, l)-tolerant. 
Let I[q,g] = {[q\g, (i,j)]|0 < i < j < 2}, where 0 < g < \n/l^\ - 1 and g = 0, 1, 
and let I[q\ = I[q, 0] U I[q, 1]. 

Lemma 5. Let G = (V, E) be a hiconnected graph on which the routing p is 
defined. For arbitrary distinct nodes x and y and arbitrary two kinds of paths 
a-path and b-path(a, b € {s, t, st} ), there exists a node z € /[g, g] such that p(x, z) 
is a-path and p{z,y) is b-path 

Proof. It can be shown by case analysis and it is omitted in this version. 



Theorem 4. Let G be an n-node biconnected graph such that n > 18. The 
routing p on G is (2, 1) -tolerant. 

Proof. Let / be any fault and let R = R{G, p)/{f}. Let x and y be arbitrary 
distinct nonfaulty nodes in G. Except that (x,y) € E and / = (x,y), we can 
assume that / is a node because if / is an edge we can consider that one of the 
endpoints of / is faulty. We write / G [a, 6] if / G E and a < f < b. 

Since there is one fault in G, either /[0,0] = {[0; 0, (i, j)]|0 < i < j < 2} or 
/[0, 1] does not contain /. Without loss of generality, we can assume that L[0, 1] 
is fault-free. Note that /[0, 1] = {9, 10, . . . , 17}. There are 12 cases according 
to the locations of x, y and /[0, 1] and they can be similarly proved by using 
Lemma 5. We show one case that x < /[0, 1] < y(it means x < 9 and 17 < y). 

Suppose that x < I[0, 1] < y. If / = (x,y) G E, then from Lemma 5 there 
is a node z G 7[0, 1] such that p{x,z) is an st-path and p{z,y) is a t-path and 

^ It may be possible that x = z or y = z. It can occur that x,j/ G I[q,g]. In this case 
an empty path is considered to be a(6)-path, respectively 





Optimal Fault-Tolerant Routings for fc-Connected Graphs 311 



they do not contain /. If / G [s = 0, x — 1] then from Lemma 5 there is a node 
z G ^[0)1] such that both p{x,z) and p{z,y) are t-paths and their routes are 
fault-free. If / G [x -I- 1,7], then from Lemma 5 there is a node 2 G /[0, 1] such 
that p(x, z) is an s-path and p{z, y) is a t-path and they are fault-free. The cases 
that / G [18, y — 1] and / G [y + l,t = n — 1] can be proved similarly. Therefore 
disR{x,y) < 2 . 

The routing p is not an edge-routing because there is a case that (x, y) € E 
and p(x, y) is defined as an st-path. However, it can be changed into an edge- 
routing as follows. Since both an s-path and a t-path from x to y become an edge 
if (x, y) € E from the definition, if p(x, y) is defined by an st-path and (x, y) G E, 
then p{x,y) is defined as the edge (x,y). We can show that the modified p is 
(2, l)-tolerant. 

In the proof of Theorem 4, we only use the routes between nodes in /[O], 
from nodes in /[O] to other nodes and from nodes not in J[0] to nodes in /[Oj. 
Thus, we can obtain a (2, l)-tolerant unidirectional edge-routing pi in which the 
total number of routes is 0{n) as follows, 
routing pi 

Let X and y be represented as [g^,; (ix,jx)] and [g^; py, (iy,jy)] , respectively 
same as in p. p{x,y) is defined as shown in Fig. 4 if (g^, = Qy = 0), {qx yf 0 and 
qy = 0) or = 0 and qy yf 0) 

Theorem 5. Let G he an n-node hiconnected graph such that n > 18. The 
routing p\ on G is (2, 1) -tolerant edge-routing with 0{n) routes. 

4.3 Optimal Routing with Route-Degree o(n) 

We construct a (2, 1) -tolerant routing p 2 with route degree 0{^/ri) for n-node 
biconnected graphs by modifying the routing p. 

Let G = {V,E) be an n-node biconnected graph. We assume that each node 
X in G is denoted by [g^; 5x, (G) jx)] same as in p. For simplicity, there is an 
integer £ such that n/18 = . This condition can be easily removed and it will 

be shown in the final version. Thus, g^ can be represented by (g^,g^), where 

1 < = vWl8- 

routing p 2 

Let X and y be represented as [g^ = {qxjQ^)'-!9x, (G,Jx)] and 
% ~ lily j Qy)'-! 9yj {"^yGy)] ? respectively. p 2 {x,y) is defined as shown in Fig. 4 if 
qx = qy or g^ = g^. 

In the routing p 2 , the route p 2 {x,y) is defined if x and y are in the same 
interval I[g] or the right part q^ of g^ and the left part g^ of qy are equal. From 
the definition of p 2 , we can verify that the route degree is 0{y/n). We can show 
that the routing p 2 is (2, l)-tolerant by using Lemma 5 and the following lemma. 

Lemma 6. Let x = [g^ = (g^ , g^ ); 9x, (G, jx)j and 

y = [qy = {Qy !Qy)'j 9y!{'iy!jy)\ be arbitrary distinct nodes of G on which p 2 is 
defined. Then, one of the following conditions holds. 
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1. Qx = Qy, that is, X and y are in the same group I[qx = Qy]- 

2. Qx = qy , that is, the routes from x to nodes in I[qy] are defined. 

3. There is a group I[qz] such that qff = qf and qfi = q^, that is, the routes 
from X to nodes in I[qz] and from nodes in I[qz] to nodes in I[qy] are defined. 

The total number of routes defined in p 2 is 0{n^/n), because the route degree 
of each node is (\/n). From Lemma 1 the total number of routes is at least 
Q{ny/n) to define (2, l)-tolerant routings with route degree 0{y/n) The routing 
P 2 attains the lower bound of the total number of routes. 

Since in the case that n < 18 we can construct a (2, l)-tolerant edge-routing 
for n-node biconnected graphs[12], the following theorem holds. 

Theorem 6. Let G he an n-node biconnected graph. We can construct (2,1)- 
tolerant edge-routing on G with 0{ny/n) routes and route degree 0{y/n). 



5 Concluding Remarks 

We have shown two optimal edge-routings with smaller routing tables. It is 
an interesting open question whether or not there exists an (2, k — l)-tolerant 
routing with route degree 0{ky/n) for n-node /c-connected graphs(fc > 3). It is 
also an interesting open question whether or not there exists an (2, l)-tolerant 
bidirectional routing with route degree 0{y/n) for n-node biconnected graphs. 
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